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PREFACE 


This book is a development of two lectures delivered at the Harvard 
Tercentenary Conference of Arts and Sciences in the fall of 1936. The first 
of these was published in Vol. 44 of the American Mathematical Monthly, 
and is reprinted here, as Lecture I, without change. The second has 
expanded gradually until it fills the rest of the book. 

I have given many lectures on Ramanujan’s work since 1936, isolated 
lectures to a number of universities and societies in America and England, 
and connected courses in Princeton and Cambridge. Lectures TI-XII 
contain most of the substance of these courses, with the rearrangements 
and additions required to fit them for publication. In this sense they are 
genuine loctures, and they are written throughout in a lecturer’s style. 

The contents of the book are described quite accurately by its title. It 
is not a systematic account of Ramanujan’s work (though most of his more 
important discoveries are mentioned somewhere), but a series of essays 
suggested by it. In each essay I have taken some part of his work as my 
text, and have said what occurred to me about its relations to that of 
earlier and later writers. But even when I digress furthest, when I am 
writing, for example, about Rademacher’s work in Lecture VIII, or about 
Rankin’s in Lecture XI, ‘Ramanujan’ is the thread which holds the whole 
together. 

Dr R. A. Rankin has read the whole of the book both in manuscript and 
in proof, and has made a very large number of important suggestions and 
corrections. I have also to thank Dr W. N. Bailey, who helped me to 
revise Lecture VII; Mr F. M. Goodspeed, who read and criticised several 
lectures, and in particular Lecture XI; and Prof. G. N. Watson, without 
whose aid I could hardly have written Lecture XII. The photograph of 
Ramanujan was given me by Dr S. Chandrasekhar, formerly Fellow of 
Trinity College: I regret that I cannot state the name of the actual 
photographer. A considerable part of the bibliography was compiled for 
me by Dr V. Levin. But my first thanks are due to Harvard University, 
to whose invitation the book owes its existence. 

G. H. H. 


July 1940 
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I 


THE INDIAN MATHEMATICIAN RAMANUJAN 


I have set myself a task in these lectures which is genuinely difficult and 
which, if I were determined to begin by making every excuse for failure, 
I might represent as almost impossible. I have to form myself, as I have 
never really formed before, and to try to help you to form, some sort of 
reasoned estimate of the most romantic figure in the recent history of mathe- 
matics; a man whose career seems full of paradoxes and contradictions, who 
defies almost all the canons by which we are accustomed to judge one 
another, and about whom all of us will probably agree in one judgment only, 
that he was in some sense a very great mathematician. 

The difficulties in judging Ramanujan are obvious and formidable 
enough. Ramanujan was an Indian, and I suppose that it is always a little 
difficult for an Englishman and an Indian to understand one another 
properly. He was, at the best, a half-educated Indian; he never had the 
advantages, such as they are, of an orthodox Indian training; he never was 
able to pass the “First Arts Examination” of an Indian university, and 
never could rise even to be a ‘‘ Failed B.A.” He worked, for most of his life, 
in practically complete ignorance of modern European mathematics, and 
died when he was a little over thirty and when his mathematical education 
had in some ways hardly begun. He published abundantly—his published 
papers make a volume of nearly 400 pages—but he also left a mass of un- 
published work which had never been analysed properly until the last few 
years. This work includes a great deal that is new, but much more that is 
rediscovery, and often imperfect rediscovery; and it is sometimes still 
impossible to distinguish between what he must have rediscovered and what 
he may somehow have learnt. I cannot imagine anybody saying with any 
confidence, even now, just how great a mathematician he was and still less 
how great a mathematician he might have been. 

These are genuine difficulties, but I think that we shall find some of them 
less formidable than they look, and the difficulty which is the greatest for 
me has nothing to do with the obvious paradoxes of Ramanujan’s career. 
The real difficulty for me is that Ramanujan was, in a way, my discovery. 
I did not invent him—like other great men, he invented himself—but I was 
the first really competent person who had the chance,to see some of his work, 
and I can still remember with satisfaction that I could recognise at once 
what a treasure J had found. And I suppose that I still know more of 
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Ramanujan than any one else, and am still the first authority on this 
particular subject. There are other people in England, Professor Watson in 
particular, and Professor Mordell, who know parts of his work very much 
better than I do, but neither Watson nor Mordell knew Ramanujan himself 
as I did. I saw him and talked with him almost every day for several years, 
and above all I actually collaborated with him. I owe more to him than to 
anyone else in the world with one exception, and my association with him 
is the one romantic incident in my life. The difficulty for me then is not that 
I do not know enough about him, but that I know and feel too much and 
that I simply cannot be impartial. 

I rely, for the facts of Ramanujan’s life, on Seshu Aiyar and Rama- 
chaundra Rao, whose memoir of Ramanujan is printed, along with my own, 
in his Collected Papers. He was born in 1887 in a Brahmin family at Erode 
near Kumbakonam, a fair-sized town,in the Tanjore district of the Presi- 
dency of Madras. His father was a clerk in a cloth-merchant’s office in 
Kumbakonam, and all his relatives, though of high caste, were very poor. 

He was sent at seven to the High School of Kumbakonam, and remained 
there nine years. His exceptional abilities had begun to show themselves 
before he was ten, and by the time that he was twelve or thirteen he was 
recognised as a quite abnormal boy. His biographers tell some curious 
stories of his early years. They say, for example, that soon after he had begun 
the study of trigonometry, he discovered for himself ‘‘Euler’s theorems for 
the sine and cosine” (by which I understand the relations between the 
circular and exponential functions), and was very disappointed when he 
found later, apparently from the second volume of Loney’s Trigonometry, 
that they were known already. Until he was sixteen he had never seen a 
mathematical book of any higher class. Whittaker’s Modern analysis had 
not yet spread so far, and Bromwich’s Injinite series did not exist. There 
can be no doubt that either of these books would have made a tremendous 
difference to him if they could have come his way. It was a book of a very 
different kind, Carr’s Synopsis, which first aroused Ramanujan’s full 
powers. 

Carr's book (A synopsis of elementary results in pure and applied mathe- 
matics, by George Shoobridge Carr, formerly Scholar of Gonville and Caius 
College, Cambridge, published in two volumes in 1880 and 1886) is almost 
unprocurable now. There is a copy in the Cambridge University Library, 
and there happened to be one in the library of the Government College of 
Kumbakonam, which was borrowed for Ramanujan by a friend. The book 
is not in any sense a great one, but Ramanujan has made it famous, and 
there is no doubt that it influenced him profoundly and that his acquaintance 
with it marked the real starting-point of his career. Such a book must have 
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had its qualities, and Carr’s, if not a book of any high distinction, is no mere 
third-rate textbook, but a book written with some real scholarship and 
enthusiasm and with a style and individuality of its own. Carr himself was 
a private coach in London, who came to Cambridge as an undergraduate 
when he was nearly forty, and was 12th Senior Optime in the Mathematical 
Tripos of 1880 (the same year in which he published the first volume of his 
book). He is now completely forgotten, even in his own college, except in 
so far as Ramanujan has kept his name alive; but he must have been in 
some ways rather a remarkable man. 

I suppose that the book is substantially a summary of Carr’s coaching 
notes. If you were a pupil of Carr, you worked through the appropriate 
sections of the Synopsis. It covers roughly the subjects of Schedule A of the 
present Tripos (as these subjects were understood in Cambridge in 1880), 
and is effectively the “synopsis”’ it professes to be. It contains the enuncia- 
tions of 6165 theorems, systematically and quite scientifically arranged, with 
proofs which are often little more than cross-references and are decidedly 
the least interesting part of the book. All this is exaggerated in Ramanujan’s 
famous notebooks (which contain practically no proofs at all), and any 
student of the notebooks can see that Ramanujan’s ideal of presentation 
had been copied from Carr’s. 

Carr has sections on the obvious subjects, algebra, trigonometry, calculus 
and analytical geometry, but some sections are developed disproportionally, 
and particularly the formal side of the intcgral calculus. This seems to have 
been Carr’s pet subject, and the treatment of it is very full and in its way 
definitely good. There is no theory of functions; and I very much doubt 
whether Ramanujan, to the end of his life, ever understood at all clearly 
what an analytic function is. What is more surprising, in view of Carr’s own 
tastes and Ramanujan’s later work, is that there is nothing about elliptic 
functions. However Ramanujan may have acquired his very peculiar 
knowledge of this theory, it was not from Carr. 

On the whole, considered as an inspiration for a boy of such abnormal 
gifts, Carr was not too bad, and Ramanujan responded amazingly. 

Through the new world thus opencd to him (say his Indian biographers),* 
Ramanujan went ranging with delight. It was this book which awakened his 
genius. He set himself to establish the formulae given therein. As he was without 
the aid of other books, each solution was a piece of research so far as he was 
concerned,... Ramanujan used to say that the goddess of Namakkal inspired 
him with the formulae in dreams. It is a remarkable fact that frequently, on 


rising from bed, he would note down results and rapidly verify them, though 
he was not always able to supply a rigorous proof. ... 


* Quotations (except those from my own memoir of Ramanujan) are from Seshu 
Aiyar and Ramachaundra Rao. 
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T have quoted the last sentences deliberately, not because I attach any 
importance to them—I am no more interested in the goddess of Namakkal 
than you are—but because we are now approaehing the difficult and tragic 
part of Ramanujan’s career, and we must try to understand what we can 
of his psychology and of the atmosphere surrounding him in his early years. 

T am sure that Ramanujan was no mystic and that religion, except in a 
strictly material sense, played no important part in his life. He was an 
orthodox high-caste Hindu, and always adhered (indeed with a severity 
most unusual in Indians rosident in England) to all the observances of his 
caste. He had promiscd his parents to do so, and he kept his promises to 
the letter. He was a. vegetarian in the strictest sense—this proved a terrible 
difficulty later when he fell ill—and all the time he was in Cambridge he 
cooked all his food himself, and never cooked it without first changing into 
pyjamas. 

Now the two memoirs of Ramanujan printed in the Papers (and both 
written by men who, in their different ways, knew him very well) contradict 
one another flatly about his religion. Seshu Aiyar and Ramachaundra 
Rao say 


Ramanujan had definite religious views. He had a special veneration for the 
Namakkal goddess....He believed in the existence of a Supreme Being and in 
the attainment of Godhead by men....He had scttled convictions about the 
problem of life and after...; 


while I say 


.. his religion was a matter of observance and not of intelloctual conviction, 
and I remember well his telling me (much to my surprise) that all religions 
seemed to him more or less equally true.... 3 


Which of us is right? For my part I have no doubt at all; I am quite 
certain that I am. 

Classical scholars have, I believe, a general principle, difficilior lectio 
potior—the more difficult reading is to be preferred—in textual criticism. 
If the Archbishop of Canterbury tells one man that he’ believes in God, 
and another that he does not, then it is probably the second assertion which 
is true, since otherwise it is very difficult to understand why he should have 
made it, while there are many excellent reagons for his niaking the first 
whether it be true or false. Similarly, if a strict Brahmin like Ramanujan 
told me, as he certainly did, that he had no definite beliefs, then it is 100 to 1 
that he meant what he said. 

This was no sufficient reason why Ramanujan should outrage the feelings 
of his parents or his Indian friends. He was not a reasoned infidel, but an 


* The Archbishop. 
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“aonostic” in its strict sense, who saw no particular good, and no particular 
harm, in Hinduism or in any other religion. Hinduism is, far more, for 
example, than Christianity, a religion of observance, in which belief counts 
for extremely little in any case, and, if Ramanujan’s friends assumed that 
he accepted the conventional doctrines of such a religion, and he did not 
disillusion them, he was practising a quite harmless, and probably necessary, 
economy of truth. 

This question of Ramanujan’s religion is not itself important, but it is 
not altogether irrelevant, because there is one thing which I am really 
anxious to insist upon as strongly as I can. There is quite enough about 
Ramanujan that is difficult to understand, and we have no need to go out 
of our way to manufacture mystery. For myself, I liked and admired him 
enough to wish to be a rationalist about him; and I want to makeit quite clear 
to you that Ramanujan, when he was living in Cambridge in good health 
and comfortable surroundings, was, in spite of his oddities, as reasonable, 
as sane, and in his way as shrewd a person as anyone here. The last thing 
which I want you to do is to throw up your hands and exclaim ‘“‘here is 
something unintelligible, some mysterious manifestation of the immemorial 
wisdom of the East!’’ I do not believe in the immemorial wisdom of the 
East, and the picture which I want to present to you is that of a man who 
had his peculiarities like other distinguished men, but a man in whose 
society one could take pleasure, with whom one could drink tea and discuss 
politics or mathematics; the picture in short, not of a wonder from the East, 
or an inspired idiot, or a psychological freak, but of a rational human being 
who happened to be a great mathematician. 

Until he was about seventeen, all went well with Ramanujan. 


In Decemher 1903 he passed the Matriculation Examination of the University 
of Madras, and in the January of the succeeding year he joined the Junior First 
in Arts class of the Government College, Kumbakonam, and won the Subrah- 
manyam scholarship, which is gencrally awarded for proficiency in English and 
Mathematics..., 


but after this there came a series of tragic checks. 


By this time, he was so absorbed in the study of Mathematics that in all 
lecture hours—whether devoted to English, History, or Physiology—he used to 
engage himself in some mathematical investigation, unmindful of what was 
happening in the class. This excessive devotion to mathematics and his conse- 
quent neglect of the other subjects resulted in his failure to secure promotion to 
the senior class and in the consequent discontinuance of the scholarship. Partly 
owing to disappointment and partly owing to the influence of a friend, he ran 
away northward into the Telugu country, hut returned to Kumhakonam after 
some wandering and rejoined the college. As owing to his absence he failed to 
make sufficient attendances to ohtain his term certificate in 1905, he entered 
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Pachaiyappa’s College, Madras, in 1906, but falling ill rcturncd to Kumba- 
konam. He appeared as a private student for the F.A. examination of December 
1907 and failed.... 


Ramanujan does not seem to have had any definite occupation, except 
mathematics, until 1912. In 1909 he married, and it becaine necessary for 
him to have some regular employment, but he had great diffienlty in finding 
any because of his unfortunate college career. About 1910 he began to find 
more influential Indian fricnds, Ramaswauii Aiyar and his two biographers, 
but all their efforts to find a tolerablo position for him failed, and in 1912 
he became a, clerk in the office of the Port Trust of Madras, at a salary of 
about £30 a year. He was then nearly twenty-five. The years between 
eighteen and twenty-five are the critical ycars in a mathematician’s career, 
and the damage had been done. Ramanujan’s genius never had again its 
ehance of full development. 

There is not much to say about the rest of Ramanujan’s life. His first 
substantial paper had been published in 1911, and in 1912 lis exceptional 
powers began to be understood. It is significant that, though Indians could 
befriend him, it was only the English who could get anything effective done. 
Sir Francis Spring and Sir Gilbert Walker obtained a special scholarship 
for him, £60 a year, sufficient for a married Indian to live in tolerable 
comfort. At the beginning of 1913 he wrote to ine, and Professor Neville 
and I, after many difficulties, got him to England in 1914. Here he had 
three years of uninterrupted activity, the results of which you can read in 
the Papers. He fell ill in the summer of 1917, and never really recovered, 
though he continued to work, rather spasmodically, but with no real sign 
of degeneration, until his death in 1920. He became a Fellow of the Royal 
Society early in 1918, and a Fellow of Trinity College, Cambridge, later in 
the same year (and was the first Indian elected to either society). His last 
mathematical letter on ‘“Mock-Theta functions”, the subject of Professor 
Watson’s presidential address to the London Mathematical Society last 
year, was written about two months before he died. 

The real tragedy about Ramanujan was not his early death. It is of course 
a disaster that any great man should die young, but a mathematician is 
often comparatively old at thirty, and his death may be less of a catastrophe 
than it seems. Abel died at twenty-six and, although he would no doubt 
have added a great deal more to mathematics, he could hardly have become 
a greater man. The tragedy of Ramanujan was not that he died young, but 
that, during his five unfortunate years, his genius was misdirected, side- 
tracked, and to a certain extent distorted. 

I have been looking again through what I wrote about Ramanujan 
sixteen years ago, and, although I know his work a good deal better now than 
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I did then, and can think about him more dispassionately, I do not find a 
great deal which I should particularly want to alter. But there is just one 
sentence which now seems to me indefensible. I wrote 


Opinions may differ about the importance of Ramanujan’s work, the kind of 
standard by which it should be judged, and tbe influence which it is likely to 
have on the mathematics of the future. It has not the simplicity and the 
inevitableness of the very greatest work; it would be greater if it were less 
strange. One gift it shows which no one can deny, profound and invincible 
originality. He would probably have been a greater mathematician if he could 
have been caugbt and tamed a little in his youth; he would have discovered 
more that was new, and that, no doubt, of greater importance. On the other 
hand he would have been less of a Ramanujan, and more of a European pro- 
fessor, and the loss might have been greater than the gain... 


and I stand by that except for the last sentence, which is quite ridiculous 
sentimentalism. There was no gain at all when the College at Kumbakonam 
rejected the one great man they had ever possessed, and the loss was 
irreparable; it is the worst instance that I know of the damage that can be 
done by an inefficient and inelastic educational system. Solittle was wanted, 
£60 a year for five years, occasional contact with almost anyone who had 
real knowledge and a little imagination, for the world to have gained another 
of its greatest mathematicians. 

Ramanujan’s letters to me, which are reprinted in full in the Papers, 
contain the bare statements of about 120 theorems, mostly formal identities 
extracted from his notebooks. I quote fifteen which are fairly representa- 
tive. They include two theorems, (1.14) and (1.15), which are as interesting 
as any but of which one is false and the other, as stated, misleading. The 
rest have all been verified since by somebody; in particular Rogers and 
Watson found the proofs of the extremely difficult theorems (1.10)-(1.12). 


(1.1) capt Gap 

=(+atapt~) (ata 
(1.2) 1~9(5) +055 Y= 13(5 : 3) + =" 
(1.8) 1+9(3| + (5) + 20535) + = ary 
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(.8)| (aa ga = ra r+) O—a41) 
, 1+(3) } (a3) 
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(1.13) If F(k) = 1+(5) b+ (54) e-. and F(1~—k) = (210) F(®), 
then 
ke = (Y2—1)*(2—./8)? («7 —./8)* (8 ~3 4/7)? (10 — 3)4 
x (4—4/15)4 (./15 — 4/14)? (6 — /35)?. 
(1.14) The coefficient of x in (1—2x+2z4— 229+ ...)-! is the integer 
nearest to sinh mn 
Tan ). 


(1.15) The number of numbers between A and x which are either squares 
or sums of two squares is 
K i ieee 
an{log t) , 


where K = 0:764.,. and O(x) is very small compared with the previous 
integral. 
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Tshould like you to begin by trying to reconstruct the immediate reactions 
of an ordinary professional mathematician who receives a letter like this 
from an unknown Hindu clerk. 

The first question was whether I could recognisc anything. I had proved 
things rather like (1.7) myself, and seemed vaguely familiar with (1.8). 
Actually (1.8) is classical; it is a formula of Laplace first proved properly 
by Jacobi; and (1.9) occurs in a paper published by Rogers in 1907. I 
thought that, as an expert in definite integrals, I could probably prove (1.5) 
and (1.6), and did so, though with a good deal more trouble than I had 
expected. On the whole the integral formulae seemed the least impressive. 

The series formulae (1.1)-(1.4) I found much more intriguing, and it soon 
became obvious that Ramanujan must possess much more general theorems 
and was keeping a great deal up his sleeve. The second is a formula of Bauer 
well known in the theory of Legendre series, but the others are much harder 
than they look. The theorems required in proving them can all be found now 
in Bailey’s Cambridge Tract on hypergeometric functions. 

The formulae (1.10)-(1.13) are on a different level and obviously both 
difficult and deep. An expert in elliptic functions can see at once that (1.13) 
is derived somehow from the theory of “complex multiplication”, but 
(1.10)-(1.12) defeated me completely; I had never seen anything in the 
least like them before. A single look at them is enough to show that they 
could only be written down by a mathematician of the highest class. They 
must be true because, if they were not true, no one would have had the 
imagination to invent them. Finally (you must remember that I knew 
nothing whatever about Ramanujan, and had to think of every possibility), 
the writer must be completely honest, because great mathematicians are 
commoner than thieves or humbugs of such incredible skill. 

The last two formulae stand apart because they are not right and show 
Ramanujan’s limitations, but that does not prevent them from being 
additional evidence of his extraordinary powers. The function in (1.14) is 
a genuine approximation to the coefficient, though not at all so close as 
Ramanujan imagined, and Ramanujan’s false statement was one of the 
most fruitful he ever made, since it ended by leading us to all our joint work 
on partitions. Finally (1.15), though literally “true”, is definitely mis- 
leading (and Ramanujan was under a real misapprehension). The integral 
has no advantage, as an approximation, over the simpler function 

Ka 

(1.16) Tiles 3)’ 
found in 1908 by Landau. Ramanujan was deceived by a false analogy with 
the problem of the distribution of primes. I must postpone till later what 
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T have to say about Ramanujan’s work on this side of the theory of 
numbers. 

Tt was inevitable that a very large part of Ramanujan’s work should 
prove on examination to have been anticipated. He had been carrying an 
impossible handicap, a poor and solitary Hindu pitting his brains against 
the accumulated wisdom of Europe. He had had no real teaching at all; 
there was no onc in India from whom he had anything to learn. Ho can have 
seen at the outside three or four books of good quality, all of them English. 
There had been periods in his life when he had access to the library in Madras, 
but it was not a very good one; it contained very few Fronch or Gorman 
books; and in any case Ramanujan did not know a word of either language. 
T should estimate that about two-thirds of Ramanujan’s best Indian work 
was rediscovery, and comparatively little of it was published in his litetime, 
though Watson, who has worked systematically through his notebooks, 
has since disinterred a good deal more. 

The great bulk of Ramanujan’s published work was done in England. 
His mind had hardened to some extent, and le never became at all an 
“orthodox”? mathematician, but he could still learn to do new things, and 
do them extremely well. It was impossible to teach him systematically, but 
he gradually absorbed new points of view. In particular he learnt what was 
meant by proof, and his later papers, while in some ways as odd and in- 
dividual as ever, read like the works of a well-informed mathematician. 
His methods and his weapons, however, remained essentially the same. One 
would have thought that such a formalist as Ramanujan would have 
revelled in Cauchy’s Theorem, but he practically never used it,t and the 
most astonishing testimony to his formal genius is that he never seemed 
to feel the want of it in the least. 

Tt is easy to compile an imposing list of theorems which Ramanujan 
rediscovered, Such a list naturally cannot be quite sharp, since sometimes 
he found a part only of a theorem, and sometimes, though he found the 
whole theorem, he was without the proof which is essential if the theorem is 
to be properly understood. For example, in the analytic theory of numbers 
he had, in a sense, discovered a great deal, but he was a very long way from 
understanding the real difficulties of the subject. And there is some of his 
work, mostly in the theory of elliptic functions, about which some mystery 
still remains; it is not possible, after all the work of Watson and Mordell, to 
draw the line between what he may have picked up somehow and what he 


must have found for himself. I will take only cases in which the evidence 
seems to me tolerably clear. 


* Perhaps never. There is a reference to “the theory of residues” on p. 129 3f the 
Papers, but I believe that I supplied this myself. 
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Here I must admit that I am to blame, since there is a good deal which 
we should like to know now and which I could have discovered quite easily. 
I saw Ramanujan almost every day, and could have cleared up most of the 
obscurity by a little cross-examination. Ramanujan was quite able and 
willing to give a straight answer to a question, and not in the least disposed 
to make a mystery of his achievements. I hardly asked him a single question 
of this kind; I never even asked him whether (as I think he must have done) 
he had seen Caylcey’s or Greenhill’s Elliptic functions. 

I am sorry about this now, but it does not really matter very much, and 
it was entirely natural. In the first place, I did not know that Ramanujan 
was going to die. He was not particularly interested in his own history or 
psychology; he was a mathematician anxious to get on with the job. And 
after all I too was a mathematician, and a mathematician meeting Rama- 
nujan had more interesting things to think about than historical research. 
It seemed ridiculous to worry him about how he had found this or that known 
theorem. when he was showing me half a dozen new ones almost cvery day. 

I do not think that Ramanujan discovered much in the classical theory of 
numbers, or indeed that he ever knew a great deal. He had no knowledge 
at all, at any time, of the general theory of arithmetical forms. I doubt 
whether he knew the law of quadratic reciprocity before he came here. 
Diophantine equations should have suited him, but he did comparatively 
little with them, and what he did do was not his best. Thus he gave solutions 
of Euler’s equation 


(1.17) B+ pPta = wi, 
such as 
(1.18) (f = 3a?+ 5ab—5b?,  y = 4a2—4ab+ 66%, 
‘ 2 = 5a®—5ab—3b2, w = 6a?~4ab+ 4b; 
and 


% = m7 —3m4(14+p)+ m(2 + 6p + 3p”), 
(1.19) y = 2m8— 3m3(1 + 29) +14 3p + 3p?, 
z=m&—1—3p—3p?, w= m7 —3m'p+m(3p?— 1); 
but neither of these is the general solution. 
He rediscovered the famous theorem of von Staudt about the Bernoullian 


numbers: 
11 1 


(1.20) (“IP B= G45 tote te, 
where 9, g, ... are those odd primes such that p—1,q—1, ... are divisors of 
2n, and G, is an integer. In what sense he had proved it it is diffieult to say, 
since he found it at a time of his life when he had hardly formed any definite 
concept of proof. As Littlewood says, “the clear-cut idea of what is meant 


€ 
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by a proof, nowadays so familiar as to be taken for granted, he perhaps did 
not possess at all; if a significant piece of reasoning occurred somewhere, 
and the total mixture of evidence and intuition gave him certainty, he 
looked no further”. I shall have something to say later about this question 
of proof, but I postpone it to another context in which it is much more 
important. In this case there is nothing in the proof that was not obviously 
within Ramanujan’s powers. 

There is a considerable chapter of the theory of numbers, in particular 
the theory of the representation of integers by sums of squares, which is 
closely bound up with the theory of elliptic functions. Thus the number of 
representations of n by two squares is 

(1.21) r(n) = 4{d,(n) —ds(n)}, 
where d,(n) is the number of divisors of n of the form 44+ 1 and d,(n) the 
number of divisors of the form 44+ 3. Jacobi gave similar formulae for 
4, 6 and 8 squares. Ramanujan found all these, and much more of the 
same kind. 

He also found Legendre’s theorem that is the sum of 3 squares except 
when it is of the form 


(1.22) 49(8k+-7), 


but I do not attach much-importance to this. The theorem is quite easy to 
guess and difficult to prove. All known proofs depend upon the general 
theory of ternary forms, of which Ramanujan knew nothing, and I agree 
with Professor Dickson in thinking it very unlikely that he possessed one. 
In any case he knew nothing about the number of representations. 

Ramanujan, then, before he came to England, had added comparatively 
little to the theory of numbers; but no one can understand him who does 
_ not understand his passion for numbers in themselves. I wrote before 


He could remember the idiosyncrasies of numbers in an almost uncanny way. 
It was Littlewood who said that every positive integer was one of Ramanujan’s 
personal friends. I remember going to see him once when he was lying ill in 
Putney. I had ridden in taxi-eab No. 1729, and remarked that the number 
seemed to me rather a dull onc, and that I boped that it was not an unfavourable 
omen. “No,” he replied, “‘it is a very interesting number; it is the smallest 
number expressible as a sum of two eubes in two different ways.” T asked him, 
naturally, whether he could tell me the solution of the corresponding problem 
for fourth powers; and he replied, after a moment’s thought, that he knew no 
obvious example, and supposed that the first such number must be very large. 


In algebra, Ramanujan’s main work was concerned with hypergeometric 
series and continued fractions (I use the word algebra, of course, in its old- 


~  * 1729 = 128-415 = 1084.93, 
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fashioned sense). These subjects suited him exactly, and here he was un- 
questionably one of the great masters. There are three now famousidentities, 
the “Dougall-Ramanujan identity” 


20 gs (a-+y+e2+ut 26+ 1) Lin) 
a La 4 ~ 
(1.23) Pn 1)" (s+ 2n) 1) (atyt2+utela apaule+s+1)™ 
8 - T(e+st ly Pyteturst)) 
“Tie+l)Tlutytzt+utst ) ayen T(2+u+s+l) ‘ 


whee, a = a(a+1)...(a+n—-1), @=a(a~1)...(a—n+]), 


and the “ Rogers-Ramanujan identities” 


1424 a cis a 
"I=g (-gQ-@) ' G-9)0-@)U-#) 
1 

~(-g0-@)... 0-1-9)...’ 
(1.24) ‘ ‘ 

i poet eee ee + 
l—q¢' (l-@)(1-q@) (1-9) (1-#) (1-9) 
1 


© (L=@) (1-q")... 1-9) (1-9@)...’ 


in which he had been anticipated by British mathematicians, and about 
which I shall speak in other lectures.’ As regards hypergeometric series one 
may say, roughly, that he rediscovered the formal theory, set out in Bailey’s 
tract, agit was known up to 1920. There is something about it in Carr, and 
more in Chrystal’s Algebra, and no doubt he got his start from that. The 
four formulae (1.1)—(1.4) are highly specialised examples of this work. 

His masterpiece in continued fractions was his work on 


je 

1+ 14+ 14...’ 

which includes the theorems (1.10)-(1.12). The theory of this fraction 

depends upon the Rogers-Ramanujan identities, in which he had been 

anticipated by Rogers, but he had gone beyond Rogers in other ways and 

the theorems which I have quoted are his own. He had many other very 

general and very beautiful formulae, of which formulae like Laguerre’s 
(e+1)"—(e-1)? mn 2-1 n?-2? 


(1.28) (otl*+(e—-1lp 2+ 8e+ a+... 


(1.28) 


are extremely special cases. Watson has recently published a proof of the 
most imposing of them. 


* Sce Lectures VI and VII. 
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It is perhaps in his work in these fields that Ramanujan shows at his very 
best. I wrote 

It was his insight into algebraical formulae, transformation of infinite series, 
and so forth, that was most amazing. On this side most certainly I have never 
met his equal, and I can compare him only with Euler or Jacobi. He worked, 
far more than the majority of modern mathematicians, by induction from 
numerieal examples; all his congruenee properties of partitions, for example, 
were discovered in this way. But with his memory, his patience, and his power 
of calculation he combined a power of generalisation, a fecling for form, and 
a capacity for rapid modification of his hypotheses, that were often really 
startling, and made him, in his own peculiar field, without a rival in his day. 


I do not think now that this extremely strong language is extravagant. 
It is possible that the great days of formulae are finished, and that Rama- 
nujan ought to have been born 100 years ago; but he was by far the greatest 
formalist of his time. There have been a good many more important, and I 
suppose one must say greater, mathematicians than Ramanujan during 
the last fifty years, but not one who could stand up to him on his own ground. 
Playing the game of which he knew the rules, he could give any mathe- 
matician in the world fifteen. 

In analysis proper Ramanujan’s work is inevitably less impressive, since 
he knew no theory of functions, and you cannot do real analysis without it, 
and since the formal side of the integral calculus, which was all that he could 
learn from Carr or any other book, has been worked over so repeatedly and 
so intensively. Still, Ramanujan rediscovered an astonishing number of 
the most beautiful analytic identities. Thus the functional equation for the 
Riemann Zeta-function 


cehaes f 
&s) = a ns? 
namely 
(1.27) &(1 —s) = 2(2m)- cos den I'(s) f(s), 


stands (in an almost unrecognisable notation) in the notebooks. So does 
Poisson’s summation formula 


(1.28)  ot{$B(0)+ P(e) + A( Zar) +...} = BH GY(0) + WP) + W(2h)+...}, 
where ve) = ‘| (;) ie P(t) cos at dt 


and «8 = 27; and so also does Abel’s functional equation 


(1.29) L(x) + L(y) + L(y) + oe ' $ 1 | = 8L(1) 


eo EP 8 
for L@)= ptt gt.. 
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He had most of the formal ideas which underlie the recent work of Watson 
and of Titchmarsh and myself on “Fourier kernels” and “reciprocal 
functions”; and he could of course evaluate any evaluable definite integral. 
There is one particularly interesting formula, viz. 


ng(—s) 


sins7 ’ 


(1.30) [, {6(0) ~ 261) + 2%9(2)—...}de = 


of which he was especially fond and made continual use. This is really an 
“interpolation formula’’, which enables us to say, for example, that, under 
certain conditions, a function which vanishes for all positive integral values 
of its argument must vanish identically. I have never seen this formula 
stated explicitly by anyone elsc, though it is closely connected with the 
work of Mellin and others. 

T have left till last the two most intriguing sides of Ramanujan’s early 
work, his work on elliptic functions and in the analytic theory of numbers. 
The first is probably too specialised and intricate for anyone but an expert 
to understand, and I shall say nothing about it now.* The second subject 
js still more difficult (as anyone who has read Landau’s book on primes or 
Ingham’s tract will know), but anyone can understand roughly what the 
problems of the subject are, and any decent mathematician can understand 
roughly why they defeated Ramanujan. For this was Ramanujan’s one 
real failure; he showed, as always, astonishing imaginative power, but he 
proved next to nothing, and a great deal even of what he imagined was 
false. 

Here I am obliged to interpolate some remarks on a very difficult subject: 
proof and its importance in mathematics. All physicists, and a good many 
quite respectable mathematicians, are contemptuous about proof. I have 
heard Professor Eddington, for example, maintain that proof, as pure 
mathematicians understandit, isreally quite uninteresting and unimportant, 
and that no one who is really certain that he has found something good 
should waste his time looking for a proof. It is true that Eddington is in- 
consistent, and has sometimes even descended to proof himself. It is not 
enough for him to have direct knowledge that there are exactly 


136, 2256 


protons in the universe; he cannot resist the temptation of proving it; and 
I cannot help thinking that the proof, whatever it may be worth, gives him 
a certain amount of intellectual satisfaction. His apology would no doubt 
be that “proof”? means something quite different for him from what it 
means for a pure mathematician, and in any case we need not take him too 


® See Lecture XII, 
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literally. But the opinion which L have attributed to him, and with which 
Tam sure that almost all physicists agree at the bottom of their hearts, is 
one to which a mathematician ought to have some reply. 

T am not going to gct entangled in the analysis of a particularly prickly 
concept, but I think that there are a few points about proof where nearly 
all mathematicians are agreed. In the first place, even if we do not under- 
stand exactly what proof is, we can, in ordinary analysis at any rate, re- 
cognise a proof when we see one. Secondly, there are two different motives 
in any presentation of a proof. The first motive is simply to secure con- 
viction. The second is to exhibit the conclusion as the climax of a con- 
ventional pattern of propositions, a sequence of propositions whose truth 
is admitted and which are arranged in accordance with rules. These are the 
two ideals, and experience shows that, except in the simplest mathematics, 
we can hardly ever satisfy the first ideal without also satisfying the second. 
We may be able to recognise directly that 5, or even 17, isprime, but nobody 
can convince himself that 9127} 


is prime except by studying a proof. No one has ever had an imagination 
so vivid and comprehensive as that. 

A mathematician usually discovers a theorem by an effort of intuition; 
the conclusion strikes him as plausible, and he sets to work to manufacture 
a proof. Sometimes this is a matter of routine, and any well-trained pro- 
fessional could supply what is wanted, but more often imagination is a very 
unreliable guide. In particular this is so in the analytic theory of numbers, 
where even Ramanujan’s imagination led him very seriously astray. 

There is a striking example, which I have very often quoted, of a false 
conjecture which seems to have been endorsed even by Gauss and which 
took about 100 years to refute. The central problem of the analytic theory 
of numbers is that of the distribution of the primes. The number 7(x) of 
primes less than a large nwmber x is approximately 


xz 


(1.31) ine 


this is the “Prime Number Theorem”, which had been conjectured for a 
very long time, but was never established properly until Hadamard and 
de la Vallée-Poussin proved it in 1896. The approximation errs by defect, 
and a much better one is 


(1.32) liz = 
In some ways a still better one is 
(1.88) lie—}ligt—diat—Lliat+Miat—Lliat+... 
* The integral is a ‘principal value’. See § 2.2. 
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(we need not trouble now about the law of formation of the series). It is 
extremely natural to infer that 

(1.34) a(x) <liz, 
at any rate for large x, and Gauss and other mathematicians commented 
on the high probability of this conjecture. The conjecture is not only plaus- 
ible but is supported by all the evidence of the facts. The primes are known 
up to 10,000,000, and their number at intervals up to 1,000,000,000, and 
(1.34) is true for every value of x for which data exist. 

In 1912 Littlewood proved that the conjecture is false, and that there 
are an infinity of values of # for which the sign of inequality in (1.34) must 
be reversed. In particular, there is a number X such that (1.34) is false for 
some # less than X. Littlewood proved the existence of X, but his method 
did not give any particular value, and it is only very recently that an 
admissible value, viz. 

X = 100", 
was found by Skewes. I think that this is the largest number which has 
ever served any definite purpose in mathematics. 

The number of protons in the universe is about 

108°, 
The number of possible games of chess is much larger, perhaps 
1910" 


(in any case a second-order exponential). If the universe were the chess- 
board, the protons the chessmen, and any interchange in the position of 
two protons a move, then the number of possible games would be something 
like the Skewes number. However much the number may be reduced by 
refinements on Skewes’s argument, it docs not seem at all likely that we 
shall ever know a single instance of the truth of Littlewood’s theorem. 

This is an example in which the truth has defeated not only all the 
evidence of the facts and of common sense but even a mathematical imagina- 
tion so powerful and profound as that of Gauss; but of course it is taken 
from the most difficult parts of the theory. No part of the theory of primes 
is really easy, but up to a point simple arguments, although they will prove 
very little, do not actually mislead us. For example, there are simple 
arguments which might lead any good mathematician to the conclusion 

: x 

of the Prime Number Theorem,’ or, what is the same thing, to the conclusion 


? f(z)~g(x) means that the ratio f/g tends to unity. 
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that 
(1.36) Py~ nlogn, 


where p, is the n-th prime number. 
In the first place, we may start from Euler’s identity 


ve — EA. ae ae sak 
(1.87) Ia pes het ptt De 


(1-24) (1-3) (1-5). 


This is true for s>1, but both series and product become infinito for s = 1. 
Tt is natural to argue that, when s = 1, the series and the product should 
diverge in the same sort of way. Also 


1 1 1 i 1 
(1.38) log iT; aa Slog =e = 25+2(sat Spe ) ; 


and the last series remains finite for s = 1. It is natural to infer that 


gt 
P 
diverges like log (z ;| ; 
or, more precisely, that 
(1.39) x me D3 ale ~loglogx 
psa P 
for large z. Since also 
2 alonn logan ~log log x, 


formula (1.39) indicates that p, is about nlogn. 
There is a slightly more sophisticated argument which is really simpler. 
It is easy to see that the highest power of a prime p which divides 2! is 


l-Leb Ta} 


where [y] denotes the integral part of y. Hence 


= [I pipitiaipe... 
pSe 


(1.40) log x! = = ([Z]+[S]+-)toee. 


The left-hand side of (1.40) is practically slog x, by Stirling’s Theorem. As 
regards the right-hand side, one may argue; squares, cubes, ... of primes are 
comparatively rare, and the terms involving them should be unimportant, 
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and it should also make comparatively little difference if wo replace [x/p] 
by /p. We thus infer that 


zy MOB Plow x, > 
psc P pSr 
and this again just fits the view that p, is approximately nlogn. 

This is broadly the argument used, naturally in a less naive form, by 
Tchebychef, who was the first to make substantial progress in the theory of 
primes, and I imagine that Ramanujan began by arguing in the samo sort 
of way, though there is nothing in the notebooks to show. All that is plain 
is that Ramanujan found the form of the Primo Number Theorom for 
himself. This was a considerable achievement; for the nen who had found 
the form of the theorem before him, like Legendre, Gauss, and Dirichlet, 
had all been very great mathematicians; and Ramanujan found other 
formulae which lie still further below the surface. Perhaps tho best instance 
is (1.15). The integral is better replaced by the simpler function (1.16), but 
what Ramanujan says is correct as it stands and was proved by Landau in 
1909; and there is nothing obvious to suggest its truth. 

The fact remains that hardly any of Ramanujan’s work in this field had 
any permanent value. The analytic theory of numbers is one of those 
exceptional branches of mathematics in which proof really is everything 
and nothing short of absolute rigour counts. The achievement of the 
mathematicians who found the Prime Number Theorem was quite a small 
thing compared with that of those who found the proof. It is not merely 
that in this theory (as Littlewood’s theorem shows) you can never be sure 
of,the facts without the proof, though this is important enough. The whole 
history of the Prime Number Theorem, and the other big theorems of the 
subject, shows that you eannot reach any real understanding of the structure 
and meaning of the theory, or have any sound instincts to guide you in 
" farther research, until you have mastered the proofs. It is comparatively 
easy to make clever guesses; indeed there are theorems, like “Goldbach’s 
Theorem”’,t which have never been proved and which any fool could have 
guessed. 

The theory of primes depends upon the properties of Riemann’s function 
(s), considered as an analytic function of the complex variable s, and in 
particular on the distribution of its zeros; and Ramanujan knew nothing 
at all about the theory of analytic functions. I wrote before 


log p 
=+ ~log x, 
DP od 


Ramanujan’s theory of primes was vitiated by his ignorance of the theory of 
functions of a complex variable. It was (so to say) what the theory might be if 
the Zeta-fanction had no complex zeros. His method depended upon a wholesale 


* “Any even number greater than 2 is the sum of two primes.” 
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use of divergent series....That his proofs sliould have been invalid was only te 
be expected. But the mistakes went deeper than that, and many of the actual 
results were false. He had obtained the dominant terms of the elassiea] formulae, 
although by invalid methods; but none of them are sueh elose approximations 


as he supposed. : 
This may be said to have been Ramanujan’s one great failure... 


and if I had stopped there I should have had nothing to add, but I allowed , 
myself again to be led away by sentimentalisin. I went on to argue that 
‘this failure was more wonderful than any of is triumply”, and that is an 
absurd exaggoration. It is no use trying to pretend that failure is someting 
else. This much perhaps we nay say, that his failure is ono whieh, on the 
balanee, should increase and not diminish our admiration for his gifts, since 
it gives us additional, and surprising, evidence of his imagination and 
versatility. 

But the reputation of a mathematician eannot be made by failures or by 
rediseoveries; it must rest primarily, and rightly, on aetual and original 
achievement. I have to justify Ramanujan on this ground, and that I hope 
to do in my later leetures. 


NOTES ON LECTURE I 


p, 1. This lecture is a reprint of ono delivered at the Harvard Toreontonary 
Conferenee of Arts and Scionces ou August 31, 1936, and publishod in tho American 
Math. Monthly, 44 (1937), 187-155. 

pp. 7-8. For (1.1), see Preece (2); for (1.2) and (1.3), Hardy (4,8); for (1.4), Hardy(4), 
Whipple (1) and Watson (7). All these formulae are spocial cases of much more general 
formulao discussed in Bailey’s Cambridge Traet Generalised hypergeometric series 
(no. 32, 1935). See alse Lecture VII. 

For (1.5) and (1.6), see no. 11 of the Papers, and Hardy (5, 6). 

There are formulae of the same type as (1.7) in a paper by Hardy in the Quarterly 
Journal of Math. 35 (1904), 198-207. The formula (1.7) itself is proved by Preece (2). 
See also no. 11 of the Papers. 

For (1.8), ses Watson (2); for (1.9), Preece (4); for (1.10}-(1.12), Watson (4, 6); 
and for (1.13), Watson (8). 

As regards (1.15), see Landau, Archiv der Math. und Physik (3), 13 (1908), 305-315, 
and Stanley (1). Miss Stanley shows just how Ramanujan’s statemont is misleading. 
See also Lecture IV (B). 

p. 10. The Librarian of the University of Madras has very kindly sent me a copy of 
the Catalogue published in 1914, which makes it plain that the library was better 
equipped than I had supposed. For example it possessed two standard Freneh 
treatises on elliptic functions (Appell and Laeour, Tannory and Molk) as well as the 
‘books of Cayley and Greenhill. It seems plain from other evidonco that Ramanujan 
know something of the English books but nothing of the French ones. 

p. 11, Euler found the general rational solution of (1.17), and his solution was 
afterwards simplified by Binot and other writers. See, for example, Hardy and 
Wright, 198-202. A number of special solutions similar to (1.18) will be found in 


x 
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Dickson’s History, ii, 500 et seg. The solution (1.19) is substantially the same as one 
found by J. R. Young (Dickson, History, ii, 554). 
The simplest known solution of 
vipat a 2t+ti 
is Euler’s 1584+ 594 == 1341+ 1334 = 635318657. 


See Dickson, Inéroduciion, 60-62, and History, ii, 644~647. Euler gave a solution 
involving two parameters, but no ‘general’ solution is known. 

Thero is a proof of von Staudt’s theorem, due to R. Rado, in Hardy and Wright, 
89-92. 

The quotation from Littlewood is from his review of tho Papers. 

p. 12, The general theory of the representation of numbers as sums of an even 
number of squares is discussed in Lecturo IX. For Legendre’s ‘three square’ 
theorem see Landau, Vorlesungen, i, 114-122. 

p. 13. Laguerro’s formula (1.26) is formula (18) of Ch. xi of the ‘second edition’ 
of Ramanujan’s notebooks. See Watson (10), 146. 

For ‘the most imposmg’ formula see Watson (14). 

p. 14. The equation (1.29) was rediscovered by Rogers, Proc. London Math. Soc. 
(2), 4 (1907), 169-189, and is attributed to him in the Papers, 337; but it is to be 
found in a posthumous fragment of Abol (Huvres, ii, 193). 

p. 15. For (1.30) see Lecturo XI. 

p. 17. Skewes, Journal London Math. Soc. 8 (1933), 277-283. Skewes assumes 
the truth of the Riemann Hypothesis, but he has since found a (much larger) value 
for X independent of the hypothesis. This work is still unpublished, 

p. 18. For (1.40) see, for example, Hardy and Wright, 342; Ingham, 20; Landau, 
Handbuch, 75-76. 


II 


RAMANUJAN AND THE THEORY 
OF PRIME NUMBERS 


2.14. I shall begin by discussing Ramanujan’s work in tho ‘‘analytic” 
theory of numbers, and particularly on its most famous problem, the pro- 
blem of the distribution of the primes. I told you that his work in this field 
has little permanent value, but I am not afraid that you will find what I 
have to say the less interesting for that. The problem is one of the most 
fascinating in the whole of mathematics, and Ramanujan’s attack on it 
was conceived in a thoroughly interesting way; and I have unpublished 
manuscripts to refer to, and can explain to you for the first time just where 
and how he failed. 

Ramanujan wrote about the problem in both of his first two letters. He 
did not write at length, and I can quote what he says in full. 


16 Jan. 1913 
In p. 367 it is stated that ‘the mumber of prime numbers less than 2 is 


2 dt f 
e loge ee) 


where the precise order of p(x) has not been determined”’. 

I. I have found a function which exactly represents the number of 
prime numbers less than x, “exactly” in the sense that the difference be- 
tween the function and the actual number of primes is generally 0 or some 
small finite value even when x becomes infinite. I have got the function in 
the form of infinite series and have expressed it in two ways. 

(1) In terms of Bernoullian numbers. From this we can easily calculate 
the number of prime numbers up to 100 millions, with generally no error 
and in some cases with an error of 1 or 2, 

(2) As a definite integral from which we can calculato for all values. 

T have observed that p(e?”*) is of such a nature that its value is very smal} 
when lies between 0 and 8 (its value is less than a few hundreds when 
“= 3) and rapidly increases when 2 is greater than 3. 

Il. I have also got expressions to find the actual number of prime 
numbers of the form d4n+ B. 


f + 
* Of my Cambridge Track Orders of infinity (no. 12, ed. 2, 1924). 


* I have altered the sign of the second term so as to agree with Ramanujan’s later 
assertions, 
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The difference between the number of prime numbers of the form 4n—1 
and which are less than «, and those of the form 4n-+ 1 less than «, is infinite 
when « becomes infinite. ... 


29 Feb. 1913 
1. The number of prime numbers less than + is 
2 y d 
AA POTTS Re ae 
ely iP er 1) es 1) 
where 
1 1 
(2.1.2) St+1)=imtgat-. 
and y = log x. 


2. The number of prime numbers less than z is 


2/2 /loga 4 floga\® 6 /logz\5 
ey) = {ze ( Qa )+s5( 27) +55; ( 2a ) tof 


where B, = 3, B, = 4, ..., the Bernoullian numbers, 


3. The number of prime numbers less than « is 


nia | eee af de lpr dt pee dé 
Ge) elogt 2), logt 3), logt 5), logt 6J, logt “” 


wrere ¢ = 1-45136380 nearly ....... ‘ 
1 have also found expressions for the number of prime numbers of a given 
form (say of the form 247 + 17) less than any given number. 


Primes of the form 4n+ 1 = primes of the form 6éx+1, 
6n—1,° 
Pedise Santee cutvesevendess l2n+1. 


Those of the forms 8%+8, 8%+5, 8n+7, 12n+5, 12n+7, and 12n+11 
are all equal. 


But 
(primes of the form 4n—1)—(those of the form 42+1)+0, 
(sigheowmsaeete neers GD) = (sasanscnveiasiens vanes 6n+1)> 0, 
(Eee ae Sp ae Caen 8n+1)>0, 
(Getennrteanr eres V2 5) S Ca cecisdesveveencaave 12% +1) 00. 


* Ramanujan proceeds to explain the law of formation of the series. It is of course 
elim 
(2.1.8) phn) { oa 
m Je logt 

where y(m) is the Mébius function which is (—1)? when m is “‘quadratfrei”’ (a 
product of p different primes) and 0 otherwise. 

He continues with instructions concerning computation from the series (see 
Papers, 351). 
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I have not merely shown that the difference tends to infinity, but found 
out expressions (like those for prime numbers) for tho difference, within 
any given number.......... 


We must read tho seeond letter in the light of the first; the formulae 
(2.1.1), (2.1.8) and (2.1.4) are naturally not exact.t Ramanujan defines 
three funetions, the integral (2.1.1), which L will call /(0), and the two 
sories (2.1.3) and (2.1.4), which L will call (@(*) and A(x) respectively, and 
asserts that they difler boundedly from a(a): if F(r) is any one of the three 
functions, then 

(2.1.6) me) = For) + O(1). 

The series R(x) is a famous series which occurs in Riemann’s work, aud @ 
series much like G(x) was found by Gram. The integral J(2) had never, so 
far as I know, appeared before; and in any case Lam sure (for reasons which 
I will state later) that Ramanujan had found all three functions lor himself. 


Ramanujan’s series and integral 
2.2. The series used by Gram is not G(x) bnt 
(log we)!" 
2.2.1 = 1 a 
ia) aw) = Dek Sie 
and G(x) is twice tho sum of its odd terms.? This scrics can (as no doubt 
Ramanujan knew) be shown to be identical with R(x). 


Wo require some preliminary observations about A(x), which I will 
write in the form (2.1.5).3 It is known that 


(2.2.2) ohm 6 
and 
(2.2.3) oe oh 
md 


(though even the first of these equations is as “deep” as the Prime Number 
Theorem). It follows from (2.2.2) that the value of ¢ in (2.1.4) is immatorial 
{so long as it is not 1). In particular, if we define the logarithm integral 


liz, when #>1, by 
—e 
oe | logt =tm({) oy dines 


* Since a(x) is discontinuous. 
QR—Lgy BR 
? Since aes 
(2k) = Bay 


(I follow Ramanujan’s notation for the Bernoullian numbors, writing By, where 
2B, is more usual.) 


3 See the footnote to p. 23. 
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(a “principal value” in Cauchy’s sense), then we may write (2.1.5) in the 
more usual form 
(2.2.4) Ree we asim, 
It is known that 


F & (log x)" 
(2.2.5) liz = 7+ logloga: + oa 


where y is Euler’s constant. Hence 


1 
(2.2.6) lia’ = y +log log a — logm+ 5 oe 


= —logm+y-+loglogx+ f=) 


when m-—>oo, and the convergence of (2.2.4) depends upon that of (2.2.2) 
and (2.2.3). 

If we substitute from (2.2.6) into R(x), and remember the formulae 
(2.2.2) and (2.2.3), we obtain 


Ra) = (y+ loglogz) FA) — 9 hog m+ 5 Hy, SOBA 
loge)" uye pln )_ (log x)" ‘ 
a1 eat Bae + Zager rea 


so that the sums of eeaee s and Gram’s series are the same. Also, if we 
write loga = y and 


g(t) = Mlogz) = My) = 14+ EA a, 

then G(x) = h(y)-h—-y), 
and it is not difficult to show that 

(2.2.7) h-y)+0 
when y> oo. Hence G(x) = R(z)+0(1), 
and Ramanujan’s series is equivalent to Gram’s. 

We can also prove that 

(2.2.8) I(x) = G(x) +0(1); 
but the proof, which ey upon the formula 

qtti spe -{° ena dy 
o Tet n? afm + (log a)?}’ 


is a little more seat If we may take this for granted, we can conclude 
that Ramanujan’s three approximations are equivalent. I may therefore 
confine myself, in what follows, to the most familiar function, R(2). 


All summations are from 1 to 0. 
> See the notes at the end of the lecture. 
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The series R(x) 


2.3. Ramanujan’s assertions amount to this, that 


(2.3.1) n(x) — R(a) = O(1), 
ie. that a(x) — R(x) is bounded. In this case 
(2.3.2) (a) — R(x) = O(a) 


for every positive 6, so that the orror is less important than any term of 
R(x). It would be an easy deduction that 


(2.3.3) a(a) = lin + O(ak?), 
(2.3.4) a(z) = lin—4dliat+ O(a), 

and so on, for every positive 6. In particular it would follow that 
(2.3.5) n(x) —liz—>—oo 


when «+00. I may as well say at once that, of these assertions, (2.3.1), 
(2.3.2), (2.8.4) and (2.3.5) are certainly false, while (2.3.3) stands or falls 
with the Riemann Hypothesis. 

Ramanujan’s main thesis is false, but it is astonishing how well it fits the 


facts. The primes are tabulated up to 10,000,000: and the valuo of m(z) has 
been found for a number of much larger values of x. The table shows some 
of the values of 7(x), and the errors of liz and R(x) by excess or default. 

x a(x) lia B(x) 

100,000 9,692 + 38 -— 6 

1,000,000 78,498 + 130 +30 

2,000,000 148,933 + 122 - 9 

3,000,000 216,816 + 155 0 

4,000,000 283,146 + 206 +33 

5,000,000 348,513 + 125 —64 

6,000,000 412,849 + 228 +24 

7,000,000 476,648 + 179 —38 

8,000,000 539,777 + 223 -— 6 

9,000,000 602,489 + 187 —53 

10,000,000 664,579 + 3839 +88 

100,000,000 8,761,455 +755 |‘ +97 

1,000,000,000 50,847,478 +1758 —23 


The largest error known in R(x) is +228 for a = 90,000,000. The agree- 
ment is very striking and (though Ramanujan had not facts on this scale 
to mislead him’) it is not at all surprising that he should have been misled. 


* He quotes only results for quite small #; errors —0-1, ~0+1, +0-2 for a = 50, 300, 
1000 (Papers, 351). 
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The early history of the theory of primes 


2.4. Ramanujan, of course, had not merely “guessed’’ his theorems; no 
flight of pure imagination could carry any man so far. He had a “proof”, 
a definite, and very ingenious, train of reasoning. This I intend to explain, 
but it is essential that I should give first a rapid sketch of the history and 
structure of the “classical” theory. 


The Prime Number Theorem, 
x 

(2.4.1) m(@) ~ioo a? 
was conjectured independently hy Legendre and hy Gauss. Gauss gave 
the approximation liz, now known to he more accurate, but neither author 
is very explicit about the degree of accuracy which they claimed for their 
formulae. As I said in my first lecture, (2.4.1) is equivalent to p, ~” log n. 

The first definite progress was made hy Tchehychef. Tchehychef proved 


that 


(2.4.2) Cx Cx 


log x anys log x’ 


or (what is the same thing) that 
(2.4.3) Cnlogn<p, < Cnlogn. 
He also introduced the functions 


(2.4.4) (x)= Dlogp, w(x)= % logp? 
psx psa 


These functions are in some ways more natural than 7(x). Thus 

Hx) = log ( TI 2) 
Sa 

(and the most natural operation to perform on a set of primes is to multiply 

them); and ¢(x) is the logarithm of the least common multiple of the num- 

hers up to x. It is the more complex function (x), as we shall see, which 

presents itself most naturally in the analytic theory. The number of 

squares, cuhes, ... of primes up to x does not exceed 


attat+at+... = O(atlog x),3 


* Luse C generally for an ‘‘absolute constant”’ (a number such as 7 or 7). The various 
C’s are naturally not equal. 
7 Count logp for every p, p®, ... up to x. Thus 


(10) = log 2+log3 + log 2+log 5+log 7 +log 2+log 3 
= Slog 2+ 2log 3+log5+log7 
(log 2, for example, arising from 2, 2% = 4, 28 = 8). 
3 Since p™ >a if m> log az/log 2, we need take only O(log x) terms of the series. 


. 
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which is insignificant compared with funetions whose order is about z. 
Hence powers of primes are eomparatively unimportant in the theory, and 
we may think of Ha) and y(q) as, for our present purpose, substantially 
the same. 

It is natural to expeet that 


(2.4.5) log &.1(2) ~ Xx) ~ (x) 
since (i) x) and y(x) are “‘much the same”, and (ii) (x) contains a(x) 


terms in most of which logy is nearly logz. Tchebychef gave an accurate 
proof of (2.4.5), and inferred that 


log a. m(x) 
& 


(2.4.6) lim = jae = lim ee) 
x x 


if any one of the three limits exists, and that the Prime Number Theorem is 
equivalent to either of 
(2.4.7) KHa)~z, (r)~a. 


Finally, he proved that the only possible value of the limits is 1; but he 
could not overcome the fundamental difficulty, the proof of the existence 
of the limits. 


The proof of the Prime Number Theorem 


2.5. The Prime Number Theorem was finally proved by Hadamard and 
de la Vallée-Poussin in 1896. The analytic attack upon the problem depends 
on Huler’s identity 


2.5.1 284 
(2.5.1) le) = Lin Nias i 


where Sts > 1. It follows from (2.5.1) that 


: 1 1 
log &(s) = Slo: ee 
oe 2 Sip 2 mpi 


the summation being over all primes 7 and all positive integers m. Hence 


: log 
(2.5.2) 8) _ 5 logp _ =i 
&(s) a ps Bann, 
where a, = logp (n= 6), 
a, = 0 (otherwise). 


Bere @,, is the arithmetieal function usually denoted by A(n), and the 
“sum Fanetioni ee 
A(z) = ay 


of a, is (ax). a 
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We now require a general theorem in the theory of Dirichlct’s series. 
Suppose that s = +1, and that 
J(s) = Xa, n-§ 
is absolutely convergent for o > 1. Then 


1 ertd 7/8 
—— [ Yas 
27 J e-ico & 
is 0, 3,1 
in the three cases 0<y <1, y = 1, y>1;' and it is easily proved that 
I as 4 &\8ds 8, 

sp [Mas = Ba, (TF = Nan = AN 
where A*(x) differs from A(x) only in that, when x is an intoger, the last 
term a, in A(x) is to be multiplied by 4. In particular 


etio et 
(2.5.3) f(x) = I ns f(s) ds, 
where ¢ > 1 and J(s) =- wa 3 


This forinula contains the “analytic set-up” of the prime nunber problem. 
In what follows I shall give two approximations to the solution, the first so 
rough that, in the present state of knowledge, it cannot be earried through. 

It is known that ¢(s) is an analytic function of s, which has a simple pole 
of the type 1 


eat 


at s = 1, but is otherwise regular. It satisfies Riemann’s equation 
¢(1—s) = 2(27)-* cos ism I(s) &(s). 
It has simple zeros (the “trivial” zeros) at 
$= —2, —4, ~6,... 
and an infinity of complex zeros p for all of which 0<o<1. The “Riemann 


Hypothesis”’ is the hypothesis that all the p have the real part }. 
The function (2.5.2) has also a pole of type 


2 te 
soit 
at s = 1, and poles of types 
i + : as 
s— In” s-p 7" 
* The integral is a principal value at infinity (i.e. the limit of an integral from 
c—if to c+7T) when #= 1. 
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at the zeros of ¢(s). Let us assume that the Riemann Hypothesis is 
true, and that the behaviour of f(s), when 7 > 3 and | t|>«, is ‘not too 
bad’. Then it is natural to suppose that we can move the contour of in- 
tegration in (2.5.3) across the pole at s = 1, and de duce 


1 fytio d 
(2.5.4) y (x) = tel, fs)> 8, 

where }<y<1. The a is the residue arising from the pole at s = 1. Further, 
since every element of the integrand is of order «7 in z, it is natural to suppose 
that the integral is of much the same order, and that 


(2.5.5) r(x) = &+ Ola?) 


for every f > $. Since the difference between y*(x) and y(x) is at most log a, 
this gives the Prime Number Theorem (and a good deal more). The argu- 
ment cannot be developed accurately on quite the simple lines which I have 
sketched, but (2.5.5) is actually a correct conclusion from the Riemann 


Hypothesis. 

It is easy to deduce from (2.5.5) that 

(2.5.6) n(x) = liat+ O(a), 
again for any & > 4. For (2.5.5) is equivalent to 

(2.5.7) B(x) = 2+ O(a). 

= dt)" 

Also ma) = 1+ { ea 
and so mx) —lit= pT oes +0(1) 


~ Ja logit Ja loge 
= [se —} a O(1) 


2 logt 
Ha)-v log2~2 (7 H(t)—-t 
~ loga log 2 i t(log pee) 
= oh z pANL 
= Olea) + OU] ogapl + 
= O@"), 
which is (2.5.6), 
The function 
(2.5.8) T(z) = (2) + hor(at) + dar(at) + me 


i Ibis convenient to use the notation of the Stieltjes integral: O(a) is the step-function 
with jumps log p at the points a = p. 
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ig related to #(x) much as is 7(x) to A(x). Since 
ga(at) +4a(zt) +... Sat+at+... = O(atlog x), 
we have also 
(2.5.9) I(x) = lin+ O(2*). 


Second approximation to the proof 


2.6. This is the “first approximation”’ to the proof of the Prime Number 
Theorem, but it is essential that we should get a little closer to the truth if 
we are to understand Ramanujan’s attempt. In my second approximation 
I shall discard the unproved Riemann Hypothesis; but that will not be the 
most important improvement. 

The main difficulty in my first ‘‘proof” lay, not in the assumption of the 
Riemann Hypothesis, but in the naive argument about the “order’’ of the 
integral in (2.5.4). The fundamental difficulty is that the integral is not 
absolutely convergent, and that we cannot be sure that its order is not 
much greater than that of any of its elements. We avoid this difficulty by 
considering not #*(x) itself but an average of y*(x) or s(x), and finding 
an asymptotic formula for this average. We have then to infer back from 
the average to the function itself, and this introduces a new element, a 
“Tauberian”’ element, into the proof. 

It is most convenient to state the argument in terms of general Dirichlet’s 
series, and I shall prove the following general theorem. Suppose 


(i) that 
(2.6.1) J (s) = 2a,0-° 
is absolutely convergent for 7 > 1; 


(ii) that f(s) is regular on the line o = 1, except for a simple pole, with 
residue 1, at s = 1; 


(iii) that 
(2.6.2) flo +it) = O((t|9), 
where @ <1, for 721 and large |t]; 
(iv) that 
(2.6.3) On > ~ K, 
where K isa constant. Then 
(2.6.4) A(a)~ 2, 
We write 
(2.6.5) Aya) = ["A@ndy = ["arwnay! 


* A(y) and A*(y) differ only at isolated points. 
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From wa j etio as 
AX(2) = 5 | feeds (>) 
it follows by integration that 
1 e+ico gsth d 
(2.6.6) Ate) = 55 [faery ® 
1 c+tie a 
We have also oer ver &(s) ds = z+ 0(1)," 
2.6.7 Ue a) de eA 401). 
i ale s(s +1) 


Subtracting (2.6.7) from (2.6.6), we obtain 


c+ico +4 
(2.6.8) — 40? + Ole = sal. Ks) 5 a ay 
where 
(2.6.9) g(s) = f(s) —- 


and a simple continuity argument enables us to take ¢ = 1. Thus 
1 I+teo ast 
y= sal 0 WO aaa) 


2 


(2.6.10) fe +O(e 


Now it is easy to prove that 
= O(| é|*),3 
where 0<a<l,ie. that €(s), and therefore g(s), satisfies condition (iti) of 
the theorem, Hence the integral in (2.6.10) is of the form 


t . 
2 [7 ofl onan, 


the product of z? by an integral of the type 


{ * H(A eitlows de — i ” A(t) ede, 


where {i H(é)|dt<oo, Such an integral tends to 0 when £ = loga-> 00,4 
and therefore Aj{x) — 422 = 0(2°), 


or 
(2.6.11) Ay(x) ~ $2, 


' Exactly, [v]ifa is not an integer, x} if it is. 

* We must subtract (2.6.7) from (2.6.6) before putting c = 1, since f(s) and ¢(s) cach 
have a pole at s= 1. 

3 In fact (s) = O(log |t|): see Ingham, 27, or Landau, Handbuch, 169. 

+ By the “Riemann-Lebesgue theorem” for trigonometrical integrals : see, for example, 
Titchmarsh, Fourier integrals, 11 (Theorer 1). 
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2.7. We have now to pass from (2.6.11) to (2.6.4). This requires a 
“Tauberian” argument, here of a very simple kind. If 


(2.7.1) nn = Oy, 
then Bz) = ¥ (@,—-1) = A(x) -—a+ 0(1), 


B,(e) = | "Be)dy = A,(2) —3 + 0(@, 


and so 
(2.7.2) B,(z) = o(x*); 
and 
(2.7.3) 6, > —K-1l=~L, 
say, by (2.6.3) and (2.7.1), We have to prove that 
(2.7.4) Biz) = o(z). 


Suppose that (2.7.4) is false. Then there is a positive d such that one or 
other of 
(2.7.5) B(w)> dz, B(x)< —de 


is true for arbitrarily large values of «. 
Take, for example, the first hypothesis, and suppose that B(£) > 6&. Then 


Bix) — Bie) Sprints? ~L@—£) 


and B(x) > 0€~ L(x—&) > 40k 
for Ece<e'= (1457) € 
Hees B(6+37)- Bé)> [rissa Ze 


But this contradicts (2.7.2), since each term on the left is o(£%), 

We can deduce a contradiction similarly’ from the second hypothesis 
(2.7.5), and the two contradictions prove (2.7.4). 

Tn order to deduce the Prime Number Theorem we must show that the 
function (2.5.2) satisfies the conditions (ii), (iii) and (iv) of §2.6. The last 
condition is satisfied because A(n) 20. Condition (ii) is equivalent to 


(2.7.6) £(1+ it) £0, 
and (iii) asserts rather more. It is the verification of these conditions that 


is the main difficulty of the proof. 


* But now using an interval (£’, £) to the left of £. 
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Later developments 


2.8. There are two chapters in this proof (and in all other proofs) of the 
Prime Number Theorem. The first part of the proof is properly function- 
theoretic. We show that {(s) has certain properties, and deduce an asymp- 


totic formula for 
(2%) 1 i 4 
tale) =; vou 


or for some other average of y(x). The second part is ‘““Tauberian”. The diffi- 
culties are divided differently, in different proofs, between the two chapters. 
In the ‘‘classical” proof, which I have sketched to you, the first chapter 
is difficult and the second easy, only a very simple Tauberian theorem 
being required. Later proofs simplify the first chapter at the expense of 
the second. 

The most important of these more recent developments is due, in essen- 
tials, to Wicner. Wiener and his followers have shown that we may simply 
strike out the condition (iii) in the general theorem of §2.6. We still 
need (ii); in particular, in the application to primes, we must still prove 
(2.7.6); but we do not require to know anything at all about the behaviour 
of ¢(s) at infinity. It had for long been said, a little vaguely, that “‘the 
Prime Number Theorem is equivalent to the assertion that {(s) has no 
zeros on o = 1”’, but Wiener has enabled us now to interpret this statement 
literally, and this is naturally a very important contribution to the logic of 
prime number theory. 

What is relevant now, however, is not Wiener’s generalisation, the “true” 
generalisation of the theorem, but a half-way generalisation made by Little- 
wood and myself in 1915. This half-way theorem has lost its significance 
since its supersession by Wiener’s, but it is just what is wanted for the 
comprehension of Ramanujan’s work. 


Littlewood and I found ourselves in possession of a powerful Tauberian 
theorem: if 


(2.8.1) Za,e-1 1 
when y->0, and a 
(2.8.2) dy, 0, 

then A(x) ~ a. 


This has an immediate application to the Prime Number Theorem, since, if 
we can prove that 


(2.8.3) TA(n)e-w vA, 
2 y 
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it will follow that w(x)~x. Since 
(L—e-¥) Sa, oY = (Le 4)? Bldg + Oy + 0 $y) Oo 


A(0)+A(1)e-¥ +... + Alm) eM +... 
TUT 2eU A (me Dew +, 


and 1—e-¥~y, we can regard 
Y 2, e-MU 

as a kind of average of A(n)/n. Hence a function-theoretic proof of (2.8.3) 
will lead to a proof of the Prime Number Theorem conforming to my 
general description. 

Littlewood and I, then, proceeded as follows. In the first place, we proved 
an extension of the general theorem; we showed that condition (iii) of § 2.6 
may be replaced by 


(2.8.4) fle +it) = O44), 


for any positive constant A. This of course has no importance now (since 
in fact no condition of this kind is wanted). Secondly, we deduced (2.8.3); 
and finally we applied our Tauberian theorem. It is on (2.8.3) that I want 
you to fix your attention, since this was also Ramanujan’s first objective. 


Ramanujan’s argument 


2.9. I can now pass to Ramanujan’s actual “proof”’, which he showed 
me some time after his arrival in England. I shall be able to make the points 
of it clearer if T allow myself to misinterpret his object a little. 

Ramanujan was trying to prove, not merely the Prime Number Theorem, 
not even merely a result ke (2.5.6), whichis true on the Riemann Hypothesis, 
but a much more precise result which we know to be false. He committed 
a definite, and a very curious, fallacy: his argument is not merely “un- 
rigorous” but in a more drastic sense “unsound”, and I want to make 
his mistake quite plain. I can do this more easily, and without doing him 
the slightest injustice, by talking as if his aims had been limited to the proof 
of the Prime Number Theorem. 

Ramanujan writes 


(2.9.1) Ply) = Vlog p ¥ e-P"v—log 2 F aMe-29 
D m=. 


m= 
= $1(y) ait Pal¥), 


and sets out to prove, first, that 


I 


2.9.2 wm 
( ) Aly) y 
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He does not use the notation A(n), but actually 


(2.9.3) dry) = BAnye™, 


so that the Prime Number Theorem would really follow from (2.9.2). 
Next, he writes 


(2.9.4) Bly) = Ply) -d(2y) + B(3y) — ... = Py(y)— Pp(y), 

where ©, and ®, are the functions related to ¢, and ¢, as © is to ¢. Then 
phy ery 

(2.9.5) G,(y) = Zilog p z wine = EA(n) =D 

and 
foe) Qme—By Ry 

(2.9.6) @,(y) = log 22 Te = 2log2 Toe? 

by the elementary identity + + d a z : 
evil e¥+1 e8¥+1 eu —1 


2.10. Ramanujan now makes a transformation of ®,(y). We have 


(2.10.1) ®,(y) = Ply) — 2%4(2y), 
where 

(2.10.2) Py) = ZA(n) 5 —, 
But 

(2.10.3) Bly) = ¥ A(n) Sy e-™ = Soe, 

n= m1 2 

where 

(2.10.4) CG, = SAln). 

nie 


Also, if k = Hp*, 
DA(n) = Salogp = Hlogp? = logk.t 
pik Blk 


nik 
Hence Wiy)= log ke-™ 
and 
(2.10.8) Oy(y) = % log be-*u — 2 % log ke-*y 
= Slog e-W— 2 Slog 2k e-#” +. 2log 2 Fen 2e 
2 2 
= eV log 1—e-*” log 2 +e log 8—... + 2log 2 ee 


* Since we must count log p for each of the divisors p, p*, ..., p%- 
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Combining (2.9.6) and (2.10.5), we obtain 


(2.10.6) Dy) = eV log 1—e-¥ log 2+ e-*” log 3 —.... 
Ramanujan now infers that 
(2.10.7) Dy) >1,! 
or 
(2.10.8) $y) — G(2y) + 6(8y)— +1, 


for some 1. He gives no reason, but the conclusion is correct and easily 
proved.? Up to this point his argument, though expressed in a less con- 
venient notation than that which I have used, is quite sound. 

Next, Ramanujan infers from (2.10.8) that 


(2.10.9) dy) >l. 


All that would be necessary, if he were aiming at the Prime Number Theorem 
only, would be the milder conclusion that 


(2.10.10) Py) = o(-), 


and we may continue his argument as if he asserted no more than this. 
He then states that 
1 


(2.10.11) oy) = log 9 z Qin g-2"y oa 
i 
and from (2.10.10) and (2.10.11) he deduces that 


(2.10.12) $3(y) = bly) + alw)~> 


which is (2.8.3). What he actually says and professes to derive from (2.10.9) 
is that 


(2.10.13) bly) = it 0(1), 
or at any rate 
(2.10.14) AY) = it Oy) 


for every positive 0. 

Now (2.8.3) is true; it is, as I said, the half-way stage in the “Hardy- 
Littlewood” proof; and from (2.8.3) we can deduce the Prime Number 
Theorem in an “‘elementary” manner, that is to say by arguments which 
make no use of the notion of an analytic function of the complex variable. 
* Tuse / for “‘a limit”? (not necessarily the same in different contexts). 


* For example, the serics ‘ \ 
og 1—log2+log3—~... 


is saummable (C,1). The sum is — } log }7. 
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It follows that, if Ramanujan really had proved (2.10.12), he would have 
found an elementary proof of the Prime Number Theorem, a proof involving 
no function-theory at all. In particular, he would never have needed (2.7.6); 
and this is of course enough to convince any reader who knows tho subject 
that the proof cannot possibly be correct. And in faet Ramanujan has 
deduced the true conclusion from two false propositions, the proposition 
(2.10.11), and the proposition that (2.10.8) implies (2.10.10). 


2.11. Ihad better show the falsity of these propositions at once. In the 
first place, (2.10.8) does not imply (2.10.10), and still less (2.10.9). Suppose, 
for example, that 


xy) sy, 
Then y(y)—xX(2y) + x(By)—... = yr 1 — 271-8 + Bat |) 
= (1= 2-4) +08), 
which is 0 if a= etm, 
log 2 


but yx(y) oscillates, in contradiction to Ramanujan’s statement. It is true 
that y(y) is not a power-series in e~¥, as is Ramanujan’s ¢,(y), but we can 
find such series which mimic the behaviour of v(y) as closely as we please, 
and the statement cannot be rehabilitated by any such reservation. 

It is only natural that Ramanujan’s argument should contain flaws like 
this, where his instincts misled him about the validity of difficult general 
theorems. There are true Tauberian theorems which have some superficial 
resemblance to the one which I have just refuted, and a good deal of ex- 
perience and subtlety is needed to distinguish the true from the false. His 
second error is much more surprising, since one would have expected him 
to be right about the behaviour of a special function like ¢,(y). 

He seems to have been deceived by an “integral analogy”. The integral 
analogue of the series (2.10.11) is 


(2.11.1) log 2 | “ores da, 
0 
= Ly Ce eu 1 
and te" da, = ——_ HUE 1g ee 
{, & i log 2 a de ylog2 ylog2’ 


so that (2.11.1) behaves in the manner which he attributes to (2.10.11). 
But (2.10.11) itself behaves differently, having “wobbles” of order 1/y. 
We can refute Ramanujan’s assertion in numerous ways. In the first 


place, if (2.10.11) were true it would follow (by the Hardy-Littlewood 
Tauberian theorem) that 


a 
ann : 
oe log 2 
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This is plainly false, since the series is practically doubled when x passes 
through a value 2”. 

A more direct argument is as follows. The function ¢,(y) satisfies the 
eapenion daly) — 2ba(2y) = 2e-* log 2. 


It may also be verified at once that 


a co) (- ly ort 
aly) = — log? 


satisfies ure(y) — 2yry(2y) = 2e-*Y log 2, 
and therefore hy) = daly) — aly) 
satisfies h(y)— 2h(2y) = 0. 
Also yh(y) is not a constant." 

If now we write yh(y) = A (logy), 
then H(logy) = A(log y +log 2), 


so that H is periodic and not constant. Hence yh(y) does not tend to a limit, 


nor does y¥$o(y). 
Finally we can, if we please, exhibit the “wobbles” in a formula. We 


can prove that 


atiippg se feh yy 2 LS, pf Le Bhs) | seninogs 
(2.11.2) $2) = 7~loe 2S ng, Toes VY : 
where the dash excludes the value & = 0; and the last series shows the 
wobbles, of order 1/y, explicitly. It converges rapidly, and the wobbles are 
smali compared with the dominant term. 


The genesis of Riemann’s series 


2.12. It will be plain by now that Ramanujan’s proof of the Prime 
Number Theorem was quite wrong. His errors were fundamental; he was 
wrong not merely because he could not supply the necessary “rigour”’, 
but because the path which he followed did not follow the facts. I should 
like to say that ‘‘rigour apart, he found the Hardy-Littlewood proof”, but 
I cannot. 

Ramanujan could not prove the true Prime Number Theorem, and 
naturally he could not prove the false (2.3.1) or (2.3.2). I shall say something 
later about the way in which he professes to derive them from (2.10.13), 
but I must first make a few remarks about the status of the series R(x) in 
the orthodox theory. 


+ w,{y) is a meromorphic function, while ¢,(y) has a barrier along the imaginary axis. 
The point could also be settled by calculation, 
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There are two goals in the theory. One is to prove the Prime Number 
Theorem, or some refinement upon it. The other is to find an exact analytical 
expression for 7(«), or one of its associated functions; an expression which 
may incidentally provide an approximation for w(x), but is sought ag an 
end in itself. Riemann (who never even mentions the Prime Number 
Theorem) attacked the second problem. 

It is easy to see how such identities may be derived from the integral 
(2.5.8). It is natural to suppose that the integral is equal to the sum of the 
residues at all the poles to the left of the line of integration. These are easily 
calculated, and we find the formula 

ie a £0) I 

(2.12.1) We) = a-E SF aos (1-25). 

Here the first term arises, as in (2.5.4), from s = 1, and the last from the 
“trivial” zeros. It is fairly easy to deduce a formula for the function J7(x) 
defined by (2.5.8). If H*(x) is related to [7(x) as w*(x) is to w(x), then 


(2.12.2)  I*(x) = Be- Zhe |” ah log 2. 
p 


2 (we Tulogu 
The definition of liz must be extended appropriately to cover complex z. 

We pass from [Z(«) to m(z) by one of the “inversion formulae” associated 
with the Mébius function. If 


_ > 4(é 
ae) = 3 (2), 
then (subject to certain reservations about convergence) 
J) == min)a(2). 
n=1 n 
It follows from this and (2.5.8)! that 
(2.12.8) a(t) = yee) TT (un), 
I 
and, if we ignore the difference between JZ and [I*, and replace [(z¥) in 
every term of (2.12.3) by liz!”, we obtain R(x). This involves neglecting 


every term in (2.12.2) except the leading tcrm, that is to say, substantially, 
ignoring the complex zeros of ¢(s). 


The exact proof of (2.12.1) and (2.12.2) is given in thetextbooks. Riemann 
(whose proof is not exact) argued rather differently. He observed first that 


(2.12.4) I Ha) 2-*-*des = © i | dM 2) = > a, = OBE) 


* Take (6) = Em(e8), g(E) = £7T(ef). 
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for ¢>1. From this it follows, by ‘“‘Mellin’s inversion formula”, that 


(2.12.5) I(x) = = ae B80 sae 
ife> 1. 

If 

(2.12.6) s=44+%, 
then 

(2.12.7) E(z) = 49(s— 1) 7-*8 I(4s) €(s) 


is an integral function of z. Its zeros are given by 

(2.12.8) s=p, 2=—tUp—})=T. 
They lie symmetrically about the origin, and are real if the Riemann 
Hypothesis is true; and 


(2.12.9) &(z) = g0)11(1~3). 


From (2.12.7) and (2.12.9) we obtain 
(2.12.10) log&(s) = —log (s—1)—log I'(3s + 1) + $s loga 


+ log &(0) + Zoe 1 +82, 


and Riemann substitutes from (2.12.10) into (2.12.5) and evaluates the 
terms of the resulting series separately. The dominating term of (2.12.2) 
results from the term —log (s— 1). 

There are gaps in the argument, of which the most important is the lack 
of any sufficient proof of (2.12.9). This was only made possible much later 
by Hadamard’s work on integral functions. But what I want you to notice 
particularly is that there is nowhere any explicit use of Cauchy’s Theorem. 
Riemann’s formal machinery, his term by term integrations of series, 
his use of Fourier’s Theorem in the proof of (2.12.5), and his evaluations of 
particular definite integrals, would all have been intelligible, and highly 
sympathetic, to Ramanujan. 


2.13. I return to Ramanujan’s argument. We can write (2.8.3) or 
(2.9.2) as 


* itd end 
(2.18.1) [ee dyf(z) - 


* Riemann put s = c+ it, and expressed (2.12.4) as a Fourier integral with a factor 
e-ttlog z) 


then appealing to the Fourier formulae, The two arguments are formally equivalent. 
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(though Ramanujan does not use this notation), and from these propositions 
we can deduce the Prime Number Theorem. Ramanujan thought that he 
could prove much more, viz. (2.10.18) or (2.10.14), which may be written as 


° ue oe 

(2.13.2) iF cr diple) = + O(1), 
or 

(2.13.8) i “ew aye) “ ; +0(y~). 
From these he deduced 

(2.13.4) W(a)—a = O(1) (or O(2*)) 
and the passage to 

(2.18.5) ma) ~ R(x) = O(1) (or O(2°)) 


was easy. But his argument here is unsound beyond any possibility of 
restoration. Not only are all of (2.13.2)-(2.13.5) demonstrably false, but 
the passage from (2.13.2) or (2.13.3) to (2.13.4) is also fallacious. There are 
no Tauberian theorems which permit us to make inferences like this. And 
I do not think that it is worth while to probe more closely into the details 
of Ramanujan’s reasoning. There is one reproach at any rate that cannot 
be made against it. Unsound as itis, itis not “dim”; Ramanujan was never 
dim. It contains a very interesting idea, and one which, when properly 
pruned, fits into its place in the theory. 


The question of Ramanujan’s originality 

214. Whatever you may think of Ramanujan’s argument, you will 
agree that his formal ideas were fine, and you are sure to wonder whether 
they were all his own. In particular, did he really discover the Riemann 
series himself? 

My own opinion is that he did. There is, however, just one book in which 
he might conceivably have seen the series. There was a copy of Mathews’s 
Theory of numbers in the Madras library, and this book contains a (rather 
uncritical) reproduction of Riemann’s analysis. It seems worth while to 
consider for a moment what books, of importance to him and accessible in 
Madras, Ramanujan may have consulted. 

There were five books which would have been particularly important to 
Ramanujan: Whittaker’s Modern analysis (published in 1902), Brom- 
wich’s Infinite series (1908), Mathews (1892), and Cayley’s and Greenhill’s 
treatises on elliptic functions. My own opinion is that he had seen one at 
least, and possibly both, of the last two books, but none of the other three. 
Ihave no idea how this may have happened, and my opinion is based on 
internal evidence only, but no other hypothesis seems to fit. 
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In the first place, Ramanujan cannot have scen Whittaker, since he did 
not know Cauchy’s Theorem. For the same reason, of course, he cannot 
have seen Forsyth’s Theory of functions. This is important because it 
shows that there were ‘‘obvious” books which Ramanujan had never seen 
although they were certainly accessible in Madras. 

The evidence about Bromwich is not so conclusive, but I cannot believe 
that Ramanujan had seen the book. He was keenly interested in divergent 
series, about which he had a “theory” of hisown. Bromwich has a long and 
very interesting chapter on the subject, which would have fascinated 
Ramanujan; but Ramanujan never showed any knowledge of Ces&ro or 
Borel summability, or of any of the standard work. It seems plain indeed 
from passages in his letters that he had no idea that any scientific theory 
of divergent series existed. 

I do not think, then, that Ramanujan had seen either Whittaker or 
Bromwich; butitseems plain that hehad read some book onelliptic functions, 
and I agree with Littlewood in thinking that it was probably Grecnhill’s. 
He never refers to books, but he never refers to any of the standard theorems 
of the subject as though he thought them his own. He claims to have ex- 
tended the theory in different directions, as he had done, but not to have 
invented elliptic integrals, theta functions, or modular equations. All these 
things he treats as parts of common knowledge. His own knowledge was 
remarkable both for its extent and for its limitations, and both the extent 
and the limitations fit excellently with the hypothesis that it was based on 
Greenhill’s stimulating but eccentric book." 

These theorems about prime numbers, on the other hand, Ramanujan 
claimed quite definitely as his own (though of course he recognised his 
mistakes later, and learnt the outlines of the established theory). This, to 
anyone who knew Ramanujan well, is eonclusive; but the hypothesis of 
Ramanujan’s complete independcnee is also the only one which seems to 
me to fit the facts. 

In the first place, if Ramanujan had ever seen Mathews, how could he 
have been as ignorant as he was of the classical theory of quadratic forms, 
which Mathews discusses elaborately and which fills nearly half his book? 
The series for the class-number, in particular, would have fascinated 
Ramanujan, and it is certain that he would have studied them intensively. 

But the most conclusive evidence is Mathews’s chapter on primes itself. 
This, whatever its defects, contains a fairly adequate description of Rie- 
mann’s memoir. It lays all the proper emphasis on Riemann’s great dis- 
covery, that the theory of primes depends upon the “complex” properties 
of &(s), and in particular on the location of its zeros. The complex zeros of 

* In which we learn first on p. 258 that the elliptic functions are doubly periodic. 
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£(s) dominate the analysis, as they must in any account of Riemann’s work. 
Ramanujan had no accurate ideas about analytic functions, but he knew 
quite well that an equation could have an infinity of complex roots, and he 
could have followed the argument without difficulty. It is incredible that, 
after seeing this chapter, he should have proceeded to construct a theory 
in which ‘‘all the zeros of ¢(s) were real’’. 

My conclusion is, therefore, that all this work of Ramanujan, with iis 
flashes of inspiration and its crude mistakes, was an individual and un- 
assisted achievement. There is no other hypothesis which seenis to me to be 
tenable, or to male any sort of mathematical or psychological sense. 

In conclusion I may say this. No one before Riemann, so far as I know, 
had written down R(x), and if Ramanujan found the series himself it may 
seem a very astonishing performance. Even Causs stopped at liz. 

There is, however, a certain danger of exaggeration. If we see that 


IT (@) = m(x) + dn(zt) + 4n(xt) +... 


plays a more “‘natural’’ part in the theory than 7(x) itself, as 7(z) plays a 
more natural part than 9(«); if we realise that it is Z(x) which “ corresponds 
naturally” to liz; and if we are familiar with the inversion formula of 
Mébius; then, when we pass back from (a) to m(”) by this formula, the 
series R(x) presents itself inevitably. All these ideas were familiar to 
Tchebychef, and there was no reason at all why he should not have written 
down R(x), though he never seems to have donc so. Ramanujan knew all 
these things too, as the argument which I have presented to you shows quite 
clearly, and everything that he did, however we may judge it, is thoroughly 
intelligible. 


NOTES ON LECTURE I 


This lecture is a revised and enlarged edition of one given to the London Mathe- 
matical Society on February 18, 1937. 

§ 2.1. The letters are those printed in full in the Papers, xxiii-xxix and 349-352. 
About the time when the letters were written, Narayana Aiyar (1) published 
Ramanujan’s principal assertions in the Journal Indian Math. Soe. 

T have altered Ramanujan’s notation for the sake of convenience of reforence: he 
has e* for x and a for y in (2.1.1), » for x in (2.1.3) and (2.1.4), and y for ¢ in (2.1.4). 
He also writes S,,, for €(¢+1). 

Ramanujan implies that p(z)->0o0 when z~-+00, which is false, as Littlewood proved 
in 1914, Littlewood’s proof, which is difficult, will be found in Ingham, ch. 5 or 
Landau, Vorlesungen, ii, Kap. 11. Tho similar assertions about primes in arithmetical 
progressions, at the end of the passage quoted from the second letter, are also false. 

§ 2.2. For the series (2.2.1) see Gram, Skrifter d. K. Danske Videnshabernes 
Selskab (6), 2 (1884), 185-308 (212, 295). 

For the history of (2.2.2) and (2.2.3) see Landau, Handbuch, 567-574. That the 
Prime Number Theorem could be deduced from (2.2.2) by ‘olomentary’ reasoning 
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(that is to say by reasoning independent of the theory of functions of a complex 
variable) was first proved by Landau, Wiener Sitzungsberichte, 120 (1911), 973-988; 
the Handbuch contains only a deduction from (2.2.3). 

According to Soldner, li x = 0 for # = 1-4513692346..., and it seems probable that 
this is the real value of Ramanujan’s c. 

For (2.2.5) see Bromwich, Infinite series, ed. 2, 334, 

To prove (2.2.7) we observe that 


(H1)}%y* wm) (= 1)" fy \? 
(1) ae TS nina i ee m Sal (*) 


ulm J — e~¥ 
=1-28) [ mee 
™m 0 Uu 


it ~en~ 
-2em( {7 ac “d+ log) 
mA\Jg  u 


_ 5 mlm) 
= z “mn xy, m), 


say, by (2.2.3). Now 


(ym) ~ xlysm+ 1) es du ~tog 
2) ~ X(y, mm =o Ua 
My MY: ultmtl) m+1 
yim ti 
= as = w(y,™), 
ul(mt1) 4% 
m ne 
say. If we write ze = 9(m), 


and transform the last series in (1) by partial summation, we obtain 


A(—-y) = Zg(m) wly,m). 
Now it is known that 


1 
2 = Of...) ; 
(2) g(m) (rosa 
y m+1 1 
< ~ulmtl) 
and OS Oa) frase) e i 
Hence the series is majorised by a multiple of 
1 
mlogm)?? 


and is uniformly convergent for ally. Finally, all of its terms tend to 0. 

For (2), and much stronger results, see Landau, Handbuch, 594-597. 

For the proof of (2.2.8) see Hardy (3). 

§2.3. For (2.3.8), which is truo on the Riemann Hypothesis, see Ingham, 83 
(Theorem 30) or Landau, Handbuch, 378-388. For tho falsity of (2.3.4), which is 
much easier to prove than that of (2.3.5) or the other false assertions of § 2.3, see 
Ingham, 90 (Theorem 32) or Landau, Handbuch, 711-719. 

To deduce (2.3.4) from (2.3.2), observe that 


fea) ao 2 
Re)—lie+ glial = of Sa Oe) } 
m=3 Mp=1p.pim? 


© 1 2 1 flog a\r 
=Oftoge 3 3 a )"} = o¢et 2)s 
{ e maa paop!\ 3 ( B®) 
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The table is extracted from the much more comprohensivo one in D, N. Lohmer’s 
List of prime numbers from 1 to 10,006,721 (Washington, 1914), oxcept for the values 
of li 108 and li 10°, which I have taken from Ingham. The values of 7(x) are loss byl 
than Lehmer’s, sinco ho counts 1 as a prime. Those beyond the range of the factor 
tables were found by Meissol in a series of papers in the Math. Annalen of which the 
latest appeared in vol. 25 (1885), 251-257, and afterwards confirmed by Bortelsen. 
Meissel’s method is an elaboration of the ‘sieve’ of Eratosthenos: seo Mathews, 
ch. 10. There is an account of Bertelsen’s work in a papor by Gram, Acta Math. 17 
(1893), 301-314. 

There is a long and interesting ossay on those topics in the introduction to 
J. Glaisher’s Factor table for the sixth million (London 1883). Torelli’s monograph 
Sulla totalita dei numeri primi fino ad un limite assegnato (Naplos 1901) also containg 
much interesting information. 


§ 2.4. Thero are proofs of Tchebychef’s theoroms in Inghain, in both of Landau’s 
books, and in Hardy and Wright, ch. 22. 


§§ 2,5-2.7. Thero is no proof of tho Prime Numbor Thoorem anywhere in this 
book, though §§ 2.6-2.7 contain a sketch of Landau’s proof, which is simpler than the 
original proofs of Hadamard and do la Vallée Poussin. To complete it, we must 
prove that 
(1+ at)| : 
() Hipay |= Wel) 
for an @ less than 1. This naturally presupposes (2.7.6), which is provod, in two 
different ways, in Lecture IV (A). The stronger proposition (1) can be proved by a 
development of the first proof (Hadamard’s) of (2.7.6). 

Both Ingham’s tract and Landau’s Handbuch contain (a) tho sinrplest proofs of 
the Prime Number Theorem (apart from Wioner’s proof roforrod to in § 2.8) and 
(b) more elaborate proofs of much stronger theorems. 

A ‘Tauberian’ theorem may be defined as the corrocted form of the falso con- 
verse of an ‘Abelian’ theorem. An ‘Abelian’ thoorem asserts that, if a sequonco or 
function behaves regularly, then somo average of it behaves rogularly. Thus 


Alaj~a 
implies A,(z) = ["aw) dt~ hu?: 


this Abelian theorem is true for any A(z) and in particular for the A(x) of the text. 
The converse is falso, but bocomes true when we subjoct A(z) to an appropriate 
additional condition, here implied by (2.6.8). 

The first Tauberian theorem was Tauber’s converso of Abel’s theorem on the con- 
tinuity of power series: see, for example, Bromwich, Infinite series, od. 2, 256. Since 


aot aye+aqat-h.,, = 0 ee) at (Gor ca +45) wt. 


> 


the limit of the series is the limit of a certain average Of ap, Gy +@y, Ay +A, +9). 


§ 2.8. What is here called Wiener’s proof of tho Prime Number Theorem is the 
“Wiener-Ikehara’ proof, containod in §§ 19 et seg. of Wiener’s The Fourier integral 
(Cambridge 1933). There is a quite different proof in §§ 17-18, 

The proof has been much simplified by Bochner, Math. Zeitschrft, 37 (1933), 1-9 
and Landau, Berliner Sitzungsberichte (1932), 514-521. Landau’s version embodies 
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the shortest extant proof of the Prime Number Theorem, but has not yet appeared 
in any book. The version in my lithographed lectures on ‘Ramanujan’s work’ 
(Institute for advanced study, 1936) is substantially Bochner’s. 

For the ‘Hardy-Littlewood’ proof see Quarterly Journal of Math. 46 (1915), 
215-219 or Acta Math. 41 (1918), 119-196 (127-134). The simplest proof of our 
Tauberian theorem is that given by Karamata in Math. Zeitschrift, 32 (1930), 319- 
320. 

§2.10. For the summability of log 2—log 3+... see Bromwich, Infinite series, 
ed. 1, 351. 

§ 2.11. The argument towards the end of the section was suggested to me many 
years ago by Prof. Maclagan Wedderburn. See Hardy, Quarterly Journal of Math. 
88 (1907), 269-288 (277). 

The formula (2.11.2) may be deduced by differentiation from the last formula on 
p. 288 of this paper (in which the sign of the last term should be changed). 

Ramanujan, when I disputed the truth of his statement, produced the amended 
formula 


; y ¥ ¥ ee 
daly) og 2(2 rhCrT fate }apr re 
where y Fly) = -0000098844 cos (Fock! e728) 


correct to 10 places of decimals’. This takes account explicitly of the terms in which 
k=tl. 

§ 2.12, The proofs of the ‘explicit formulae’ are given in Ingham, ch. 4 and 
Landau, Handbuch, Kap. 19. Riemann’s own argument is reproduced in Mathews, 
ch, 10. 

For the Mobius inversion formulae see Hardy and Wright, 234-237, or Landau, 
Handbuch, 577-580. 

§ 2.14. Ramanujan seems to have started his own theory of divergent series with 
series of positive terms, such as 


1742-43-94... (8<1) 


and to have ‘defined’ the sum of such a series as the constant of the Euler-Maclaurin 
sum formula. But he was not in the habit of giving strict ‘definitions’. 


EE 


ROUND NUMBERS 


3.1. A number is described in popular language as round if it is the 
product of a considerable number of comparatively small factors. Thus 
1200 = 24.3.5% would certainly be called round. The number 2187 = 3? is 
even rounder, but this is obscured by the decimal notation. 

It is a matter of eommon observation that round numbers are very rare; 
the fact may be verified by anyone who will make a habit of factorising 
numbers, such as numbers of motor-cars or railway carriages, which are 
presented to his attention in a random manner. Both Ramanujan and I 
had observed this phenomenon, which seems at first a little paradoxical, 
and were curious about its mathematical explanation. 

We therefore proposed to ourselves the problem of determining the 
“normal degree of compositeness”’ of a number n. How many prime factors 
may one expect to occur in a random large number n? 


3.2. It is natural to measure the “compositeness’’ of a number by the 
number of its prime factors; and we may count this in two different ways, 
according as we count multiple factors multiply or not. Suppose that 


(3.2.1) n= phple...pg = [ptr 


where Py <Po<.--<Dys 

is the standard expression of n as a product of primes. Then 
(3.2.2) f(n=r 

is the number of different prime factors of n, and 


(3.2.3) Fin) = 


Ms 
Q 
a. 


is the total number of prime factors when multiple factors are counted 
multiply; and either of these functions may be adopted as a measure of the 
compositeness of n. We shall find that itmakes no important difference which 
measure we adopt, and for the moment I consider f(n). 


* “Half the numbers are divisible by 2, one-third by 3, one-sixth by both 2 and 3, and 


so on. Surely then we may expect most numbers to have a large number of factors? 
But the facts seem to show the opposite.” 
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3.3. It is clear that f(n) cannot be ‘‘very large’. The worst, or roundest, 
numbers, those for which f(a) is largest compared with 7, are the numbers 


n= 2.3.5... 
which are products of the first y primes. For these numbers 
fm) =v =7(p,), 


and logn = x log p, = 0(p,). 
rSy 
Now 
(3.3.1) Avlogv<yp,< Brlogv, 
for constant A and B, and so 
log p, ~log v. 
Hence Bs 
Spe FIND) TORN 08.8 
(3.3.2) y=f(n) = Oe aa) lore 
and v~ 10g ‘ 
log log n 


Thus a large number » cannot have more than about this number of prime 
factors. A number about 10’, the limit of the tables, cannot have more than 
about 6 or 7,and a number about 10°, i.e. about the Eddington number, 
cannot have more than about 30. 
The total number of prime factors may be a good deal larger; thus 108° 

has 160, and 2 = 2* has 

log 

log 2° 
Here there is a big difference between f(n) and F(n), but we shall see that 
this is exceptional. 


3.4. The theorem just proved about f() is a theorem about aif numbers, 
and we are concerned with (in some sense) “‘almost all’. The function 


log n 
log logn 


increases slowly, but by no means so slowly as to explain the facts. We 
find, if we try numbers at random from near the end of the factor tables, 
that f(x) is usually not 7 or 8 but 3 or 4; and we should find, if the tables 
could be extended to the limits of the Eddington number, that it was usually 
not about 30 but about 5 or 6. A number like the Skewes number* would 
generally have about the Eddington number of factors (and that may help 
us to form some image of its size). 
* See Lecture I, p. 17. 
HR 4 
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We need a definition of “almost all”. Suppose that P is a property of 
a number 2 expressed by a proposition P(n); that N(x) is the number of 
numbers, up to a, for which P(n) is false; and that 

N(a) = o(zx). 
Then we say that ‘‘almost all numbers possess the property P”. Roughly, 
the proportion of exceptional 2 is infinitesimal. 

The answer to our problem is that almost all numbers n have about log logn 
prime factors. More exactly, given ¢, then almost all numbers have between 
(1—e) log log n and (1+) log log n prime factors. The theorem is true which- 
ever way we measure the number of factors, and, as we shall see, it can 
be stated still more precisely. 


3.5. The function log log n is suggested in another way. We have 
Lim=y D1l= TIS 
NSe nSa pln PMSx 
the summation being extended over all primes p and positive integers m 
satisfying the inequality; and when we sum with respect to m we obtain 


3.5.1 
( ) Pr m= 317] 
or 
(3.5.2) x f(n) =a PI it O(z), 


since the removal of a square ee me an error of 1 at most. But 


(3.5.3) > ae loglog 2+ 0(1), 
psp 
and therefore 
(3.5.4) > f(a) = vlog loge + O(z). 
nSa 
Also LZFe=DD1= F 1 
nSe ns phin plmsa 


(the summation extending over primes p and positive integers 4 and m), 
and so 


“ooo m0 2[5]-2 (hsb (S}-+) 
meni ee eee 


* p\n means ‘p divides n’, so that 2) implies counting 1 for every prime divisor 


of n. Similarly E, 1 implies counting 1 for every prime or power of a prime which 
pe 
divides n. 
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Hence we have also 
(3.5.7) x F(a) = cloglogz+ O(z). 
NEE 


In particular 
(3.5.8) XS f(y)~eloglogz, Y F(n)~xloglogx 
NSE nSe 


Now if ¢(z) is some simple increasing function, and 
G(2) + 9(8) +... + 9(%) ~ BZ) + 9(3) +... +9(m),7 
then it is natural to say that “the average order of g(n) is ¢(n)”. And since 
log log 2+...-+log log %~n log log n, 
the average order of both f(n) and F(n) is log log n. 


3.6. This is an interesting theorem, but quite different from the one we 
want to prove. We want to prove that f(n) and F(n) are usually about 
log log n. If gin) is log p log log p when x is a prime p, and 0 otherwise, thon 


y g(n) = & log p log log p ~ x log log x,? 
nSx pse 


and the average order of g(n) is still loglogn; but g(n) is usually 0 (since 
‘almost all’’ numbers are composite), so that the normal order of g(n) is 0. 
In our problem the average and the normal orders happen to be the same, 
but that is a peculiarity of the problem. 


3.7. There are two proofs of our theorem, our original proof and another 
given much later by Turan. Turan’s proof is very simple and elegant, and 
I will give it later; but I insert first a skctch of the original proof, which is 
in some ways more suggestive. 

We nced only consider one of the two functions, say f(x). For F(n) 2 f(n) 
and, by (3.5.6), 


& {F(n)—-f(n)}} 


30 -00)~z (5 }S]+-)-o 


for some C. If N(a) is the number of numbers up to x for which 
F(n)—f(n)> G, 


N(@) € 
x G? 


I 


then 


* Or 91) +... +9(n)~G(1) +... 4+¢(n). We start from 2 here because log log” is not 
defined for n = Ll. 


* Since Hx)= DT logp~z 
Sa 


P 
and the factor loglogp adds an additional log log x. 


52 Round numbers 


which is small when @ is large; and the number of numbers for which 
F(n) —f(n) > x(n), where x(n) is any function of n which increases to infinity, 
is o(z). In this sense ‘“ #'(n)—f(n) is almost always bounded’’, 

We confine our attention, then, to f(n). Suppose that w,(z) is the number 
of numbers, not exceeding a, for which f(n) = r. Then 


x 
(3.7.1) (x) ~7(x) ~“Toge’ 
More generally, it is known that 


2 (loglog 
(3.7.2) w,(2) “aa See ar: 


Now 
(8.7.3) [2] = w,(%) + w9(%) +... +0,(Z) +... 
and 
(3.7.4) @ = ——— ¢losloss 148455 ae teem oe +.. ) 
<a logz logz —1)! 
~where 
(3.7.5) € = loglogz; 


and (3.7.2) indicates a certain similarity between corresponding terms of 
the two series. The similarity cannot, of course, be too close, since the first 
series terminates; but corresponding terms of fixed rank are asymptotically 
equivalent when x0. 

The largest term in the series 


& y i) gr-l 
Peet tS Gear 
is that for which r = (£]+1.1 We write (3.7.4) as 
(3.7.6) ae 8 gr-l 2x EElte-1 


loge? I)!” loge“ (E]+n—1)!” 
# assuming both positive and negative values. By Stirling’s formula 
Ei p-2 eé 
(Ele a= 1)!” fare) 


if » is fairly small compared with £ Hence the right-hand side of (3.7.4) 
may be compared with 


(3.7.7) 


pe] 


pee 18 
ee ke Tb 5 se vem 


* There are two equal terms when ¢ is an integer. 


or with 
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and the part of this integral for which tis of higher order than ,/é is negligible. 
Thus practically all of the sum of (3.7.4) is contributed by the terms for 
which # is O(yé). It is natural to suppose that the same must be true of the 
series (3.7.3); that practically all of its sum comes from the terms in which 


|r—£| = |r—loglogz | 
is O(ys) = Ofy/(log log 2}. 
This, as we shall see in a moment, would prove our theorem anda good deal 
more. 

The argument is, as it stands, rough and inconclusive. We have to prove 
that we can neglect most of the terms w,(z) in the series (3.7.3), and for this 
we need inequalities instead of asymptotic equalities. But these can be 
found without much difficulty, and the conclusion is that, if w(x) is any 
function of x such that 

x(«) 
Mog log x) 
then almost all numbers not exceeding x have between 
log log x + x(x) 


prime factors. Since log log x and log log are practically indistinguishable 
over most of the range (1,2),’ it is the same thing to say that almost all 
numbers n have between 

log log n + x(n) 
prime factors. 


3.8. There is one remark to add before I pass to Turan’s proof of the 
theorem. The asymptotic formulae (3.7.2) are corollaries of the Prime 
Number Theorem, but the final proof is ‘‘elementary”’. We have to show 
that the “tails” of the series (3.7.3) are negligible, and for this we use an 
inequality 

x (loglogw+C)-! 


(3.8.1) a aes faye 


where A and ( are independent of both a and r. The proof of this depends 
on Tchebychef’s inequality 
x 


m(x)<A ee 


and does not require the Prime Number Theorem. 


If0<c<1 and z*<n<z, then loglogn lies betweon 


log (e1 =k = 
a Piaioes: g(clogx) = loglogx—~log(1/e) 
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3.9. Turan’s proof of the theorem depends on the identities (3.5.1) and 


(3.9.1) 2 el? = ‘Ged = | |+2 =[; |: 


To prove (8.9.1), we observe that the left-hand side is 


s(xipij- 2 le it B. 
nsx \pin prin pm=p'm' Sa pp", pp Use pmsa 
Here 2, p’ are primes, mand sz any positive integers, and the ranges of sum- 
mation are indicated by the subscripts. Summing the last sums first with 
respect to 4 and m, we obtain the result. 

Now we have seen in §3.5 that 


(3.9.2) ZX f(n) - 2/5 4 = tlog log «+ O(x). 
nSa pga 
x 1 
a | | '- © app’ : Oe *); 
we may drop the restriction p+ p’ because 2 p-? is convergent. But 
(3.9.8) (= 5) 5 5 as(23). 
poveP pp'sa PP pS2P 


since p<./a and yp’ S,/z imply pp’ $2, and pp’ Sz implies p <2 and p’ $2; 
and each of the two extreme terms in (3.9.3) is 


flog log «+ O(1)}? = (log loga)? + O(log log x). 


Hence 
(3.9.4) >i Es = x(log log x)? + O(#log log x); 
pr’ Sa, pep’ | Pp 
and it follows from (3.9.1), (3.9.2) and (3.9.4) that 
(3.9.5) = {f(n)}? = a(log log x)* + O(log log x). 
az 


Finally, writing € for log log a, as in (3.7.5), we have 
(3.9.6) p> Lf) — Ep = 2 fn P26 y {fin}+ y 1 


= ae OE} — 2EafE + O(L)} + Efe + O(L)} 
= O(28), 
by (3.9.2) and (3.9.3), But if 
| f(n)-E] > x(@) 
for more than éx of the n less than ~, then 


>» tn) ~ EP > darx?, 


which contradicts (3.9.6) if y is of higher order than \é; and this proves the 
theorem. 
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3.10. There is a curious corollary about d(x), the number of divisors of 2. 


It is familiar that 
d(1)+d(2)+...+d(n)~nlogn, 


so that the average order of d(n) is log. What is its normal order? 
If n = pops... p2, then 
(3.10.1) fiy=v, Fn) =2a,, d(n) = M(1+4,). 


Also 2e1+as 24 
Hence 

(3.10.2) 2s IT(l+a,) <2*%, 
or 

(3.10.3) 2M < d(n) s 2", 


Since f(n) and F(n) are both usually about log log n, it follows that d(m) is 
usually about 

(3.10.4) Qlog log n a (log n)s? = (log 7)°6---. 

We cannot quite say that “the normal order of d(m) is 2!3108"”, since the 
inequalities which we prove for d(m) are of a much less precise type than 
those which we prove for f(m)," but we can say, more roughly, that the 
normal order of d(m) is ‘about 2!¢!os"”?, 

In this case the normal and average orders do not agree; d(n) is usually 
much below its average order. The explanation is simple; d(n) is too ixregular. 
Most numbers have about 21818” divisors, but some have a very much 
larger number, so much larger that these abnormal numbers dominate the 
average of d(). The irregularities of f(n) and F(m) are not strong enough to 
produce a similar effect. 

It is natural to put the same question about r(n), the number of repre- 
sentations of as a sum of two squares; but in this case the answer is 
immediate. Since 

r(1)+7(2)+...4+7(2) ~70, 
the average order of r(x) is 7, The normal order, on the other hand, is 0, 
since most numbers are not representable.” 


1 Of the type Dioglon R—-XM) << din) < Dloslos n+xXn), 
The normal order of log d(n) is 
log 2 log log n. 
Az 
> Only about =o 
Vlog x) 


of the first 2 numbers are representable. See Lecture IV (B). For the average order, 
see Lecture V, § 5.1. 
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3.11. I conclude by mentioning a conjecture which I made in 1936 and 
which hag since been proved, independently, by Erdés and Pillai. 

The asymptotic formula (3.7.2) is true for every fixed r, and it is natural 
to suppose that it is also true when r is a function of x which tends to infinity 
sufficiently slowly. If it held for 


r = [log log a], 
then a simple application of Stirling’s theorem would give 
1 x : 
and at any rate we might hope to prove that 
Evo ee cae 
A (log log 2) 
for some positive A. The opposite inequality 
Ax 
J(loglog x) 


(for some A) is a trivial corollary of (3.8.1). All that I could prove, however, 
when I made the conjecture, was that 


Az 
(log log x)** 


w,(2) ~ 


(3.11.1) 


@,(a) < 


w,(2) > 


Erdos and Pillai have now proved (3.11.1), and indeed that the inequality 


is true for 
ry 


log log  — Ba/(log log x) <r < log log «+ B y(log log «); 
and Pillai has proved a good deal more, viz. that, if 0<k<e, then 
3, (at) > ie (log - ze ‘i 
for rSk(loglogx—C), 
where C depends only on &, The truth of (3.11.1), for the range of values of 
r stated, is a simple corollary. 


NOTES ON LECTURE Ir 


The main theorem of the lecture was proved by Hardy and Ramanujan, Quarterly 
Journal of Math, 48 (1917), 76-92. This paper is no. 35 of the Papers, no, 32 being a 
preliminary account. 

Turan’s proof, given in § 3.9, was published in his paper 1. We have incorporated 
a simplification suggested by Mr Marshall Hall. The proof is also reproduced in Hardy 
and Wright, § 22.13. Turan (2) has proved several generalisations of the theorem. 
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§ 3.3, The inequalities (3.3.1) and the asymptotic relation. 
A(x) 
log x 


nx}~ 


can be proved by ‘elementary’ reasoning. See for exemple Hardy and Wright, 
ch. 22, and Lecture II. 

For (3.5.3), and sharper results, see Hardy and Wright, ch. 22; Ingham, 22-24; 
Landau, Handbuch, 100-102. The slightly sharper result 


= Z = log log 2 +4+0(1) 
psaP 
(with the proper value of A) is equivalent to Mertens’s theorem 


1 er 
1-=)~ =, 
Pal 5] log 


It is almost as easy to prove the more precise equations 


where y is Euler’s constant. 


z fin) =2tog og 24-42 +0(, 2), z F(a) = # og log # +22 +O z ); 
RSX 


log a}” age log 
1 
where Ba At+S——. 
p(p—1) 


These equations are stated in § 1.2(3) of the joint paper referred to in the first of 
these notes. 

§3.7. For (3.7.2) see Landau, Handbuch, 203-213. 

§ 3.10. The maximum order of d(m) is roughly 

logn 
gioglogn 

This was first proved by Wigert: see Landau, Handbuch, 219-222. Wigert and Landau 
use the Prime Number Theorem in the proof, but Ramanujan (Papers, 85-86) 
showed that this was unnecessary. 

§ 3.11. See p. 5 of my lithographed lectures referred to in the note on § 2.8 (p. 47). 

The proofs of Erdés and Pillai have not yet been published. 
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IV 


SOME MORE PROBLEMS OF THE 
ANALYTIC THEORY OF NUMBERS 


4.1. In this lecture I return to the classical problems of the analytic 
theory of numbers. Its contents are miscellaneous and rather disconnected, 
but have a thread of unity because most of them are suggested by those of 
Ramanujan’s letters. I begin by a digression on a topic about which 
Ramanujan said nothing, but to which I referred in Lecture IT. 


A 
The proof that &(s) has no zeros on o = 1 
4,2. The prime number theorem is “‘equivalent’’ to the theorem that 
(4.2.1) (1+ it) +0, 


in the sense that no deeper properties of {(s) are required for the proof. 
The strict equivalence appears only when we use Wiener’s method, but 
(4.2.1) is essential in any version. The standard proof of (4.2.1) is 
Hadamard’s, but there is an alternative proof, due to Ingham, which 
is interesting in itself and relevant here because it depends on one of 
Ramanujan’s formulae. 

Hadamard’s proof depends on a simple trigonometrical inequality, viz. 


(4.2.2) 3+4cos6+cos 29 = 2(1+cos6)?20 
(for all real @), We use this as follows. By Buler’s formula 


* 
log &(s) = Log 


pags = Fe 4+ fl), 
where f(s) is regular for ¢ > 1 (and indeed for o > 4). Hence 
log | S(o +t) | = pe" + g(a, t) 
= Xp-? cos (plog t)+9(¢,2), 


for ¢ > 1, g(c,t) being bounded, for any fixed t, when g+ 1. 
It follows that, if we write 


X(o,t) = log | F(a) (a -+ tt) Lo +244) |, 
and continue to use g(c, ¢) in the same sense, then 


Xl, t) = Zp-7{8-+ 4cos (tlog p) + cos (2¢ log p)}-+9(a; t); 
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and therefore (since the terms of the series are positive) 


(4.2.8) x(o,8) > ~ Alt), 
where A(é) is independent of o, when o> 1. 

We now fixi. If (1+) = 90, 
then Co +it) = (1 +it+o-1) = (e-1)* (0,2), 
where k is a positive integer and logh(o,t) is bounded when o>1; and 
therefore 

log | (a +it) |< — klog ae + A(t) 
when o> 1. Also log | (a + 2it) | < A(é) 
1 

and log | &(c) | <log— +4, 


where A is constant. Thus 
x(a, t) < (38- 4k)log—++ A(i)>—-o, 


in contradiction to (4.2.3). 


4.3. Ingham’s proof depends on a formula published by Ramanujan in 
1915, viz. 


o,(n) oy(n) _ bs) &(s~a) (sb) 's-a—b) 
(4.3.1) fis) = 258° 2 (2s —a—6) 


where o,(”) is the sum of the a-th powers of the divisors of n, and 
Rs, R(s—a), R(s—b), R(s—a—h) 
are all greater than 1. In particular 


Bn) _ CAs) 
(4.3.2) Be = Fon) 


for o>1. 
To prove (4.3.1) we observe that 
xln) = o4(n) o4(2) 
“is “multiplicative”, i.e. that 
x(n") = x(r) x(n’) 
for coprime n and n’. Hence 
(4.3.3) f(s) = 1 fpls), 
P 
where runs through the primes, 
Co a a 
fylo)= 14 § AP Vee) 


aA=1 
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and s has any value for which the product is absolutely convergent. But 
od giatna | grr — 


= a As 
p(s) Pa pe~—I p— Pp 
a a att a pe - gp % l 
(ply aI) apeet ve pe ap lap 
1 — paths 
ql —p-*) (l — pr) (1 — pos) (l — prte-sy’ 
1 — prto—2s 


sothat  f0) = gayi) 00) 
_ (8) $(s—a) &(s 8) S(s—a—b) 
rien a—b) 
Let us now suppose that 


&(1+ic) = 0, 
for a positive c. If we take a = ic and 6 = ~ic in (4.3.1), we obtain 
| rig(ra) |? _ $2(s) Sle — te) &(8+ te) 
(4.3.4) fst = as) ; 


This formula is valid, in the first instance, for o> 1. But f(s) is regular for 
o> 4, since the double pole of £(s), for s = 1, is cancelled by the zcros of 
¢(s—ic) and f(s+ic); and the coefficients in the series are positive. Hence, 
by a well-known theorem of Landau, the series is convergent, and represents 
}(s), for o > 4, and in particular when s= ‘ +é>4. Thus 


| Cx 
fate) = 21a ale 
for é>0. On the other hand 
€(28) = €(1+ 28) -»00 
when é- 0, and all three factors in the numerator of (4.3.4) are bounded; 


hat 
so tha: LG +8)>0. 
The contradiction shows that £(1+%c)+0 for any positive c. 


B 
The number of numbers which are sums of two squares 
4.4, The assertion which I numbered (1.15) in my introductory lecture was 


“the number of numbers between A and a which are either squares or 
sums of two squares is 


= dt 
EK paaetiioley 
5 Mogi * 2): 


. and (2) is very small compared with the integral”. 


where K = 0-764.. 
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It is immaterial whether we inelude actual squares or not. Ramanujan 
later (i) gave the exact value of K, viz. 


(i) 


where r runs through the primes 4m + 3, and (ii) stated that O(a) is of order 


dices): 


We shall see that this last assertion is false. 


4.5, This problem was solved by Landau ijn 1908. The solution is very 
interesting because it depends on the applieation of the elassical methods 
of prime number theory to a funetion with an algebraical singularity. 

We denote primes 4m+1 and 4m+3 by ¢ and 7 respectively. In order 
that 2 should be a sum of two squares it is necessary and sufficient that 

n= WWyy, 
where 4 is a produet of primes g and v a product of primes r. If we define 
6, a8 1 when n is a sum of two squares, and 0 otherwise, then b, is 1] when 
nm = 21g IDy, 


Be oae Al 1 1 
ond fe) = 253 > Togs Woe Ve 
1 1 1 
Also 9) = j=g3 Uae os 
1 1 1 1 
and 1) tn gt a pe 
so that 
(4.5.1) {fS)P = (8) &(s) L(s), 
where 
1 1 
(4.5.2) (8) = yay Tae 


It is plain that ¥(s) is regular, and has no zeros, for ¢ > 4. As regards the 
other factors in (4.5.1), L(s) is an integral function, whose value for s = 1 
is }7, while {(s) is regular except for its pole at s = 1. It is also known that 
neither ¢(s) nor L(s) vanishes in a region D, stretching to the left of ¢ = 1, 


of type mee oes 
{log ([é] +2)}4° 

Finally, ¢(s) and L(s) are O{(log | t|)4} 

for large ¢ in D. 
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It follows that T(s) = (s—1)* 98), 


where g(s) is regular in D and 
i 


gtd) = Gmbnyy = (3rr7( |)" = Kye. 


4.6. If Bix) = + b, 
NSE 
then B(z) is the number of representable numbers up to w; and 
(4.6.1) B*(z) = yb, = wail, feeds 
4% nse” UATh e400 s 


for ¢> 1.1 We have to treat this integral (or 
some derived integral) as we treated the 
integral for y*(x) in Lecture IT. 

I give a rough sketch of a proof, which may 
be compared with the “first approximation”’ 
(§ 2.5) to the proof of the Prime Number 
Theorem. Therearetwoinportant differences. 
The integral (4.6.1) is much simpler in one 
way, since there is no zeta-function in the 
denominator; but the integrand has an alge- 
braic singularity instead of a pole. Hence 
B*(x) will be approximated not by a residue 
but by a loop integra) round s = 1. We must 
transform the path of integration into a path 
of the type C (Fig. 1), and our approximating 
function will be 


1 
(4.6.2) ml Jee Fig. 1 


= ml axe Kir xe 
= ora aa a9) 4 = tat | peo rphO® 
where 


(4.6.3) As) = 1+4,(s—1)+a,(s—1)?+... 
near s = 1, and Lis the lacet from 1—y round 1 shown in the figure. 
If we treat h(s) as 1, we obtain 
Kf ae Kee (1 e-uloga 
Pr) (hae ~ Yr Jy a 
Ka 
ai(log a)’ 


* ‘The star and dash have the same meanings as in § 2.5. 


which is practically 
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And the argument, when properly developed, will show that 
Kz by Oy 
A (log) ( *Toge” (logay?* fs 
the series being an asymptotic series in Poincaré’s sense. This is effectively 
Landau’s result, though he does not push the analysis so far. 


(4.6.4) Bia) = 


4.7. Now 


= dt Kx 6. B 
7) ) Fog * Jog (tee Tee} 


the series being again asymptotic. Thus Ramanujan’s result is, apart from ' 
the 6(x), of the same form as (4.6.4); but the coefficients in the two series 
are not the same.* Those in (4.6.4) depend, in a rather intricate way, on the 
coefficients @, a, ... in (4.6.3). The integral (4.7.1) has no advantage as an 
approximation over the first term of the series (4.6.4), and Ramanujan’s 
assertion (15), though true as it stands, is definitely misleading. He was 
misled, not unnaturally, by the analogy with the Prime Number Theorem, 
in which the logarithm integral liz is a significant and a particularly good 
approximation. 


It remains true that 


Kz 
PO” Toga 


and it would be very interesting to know just how Ramanujan came to this 
conclusion. It seems clear, from the form of K, that he used the formula 
(4.5.1). The rest of his argument was no doubt highly speculative. 


48. Ramanujan made a similar mistake later in an unpublished manu- 
script which was examined after his death by Miss Stanley, Watson, and 
myself. Ramanujan’s function r(m)* is ‘almost always” divisible by 5. 
More precisely, if t,, = 0 when 7() is divisible by 5, t, = 1 otherwise, and 


T (a) = > ta, 
NEE 


ed 
then P(x ) ~C log a) 


for a certain O. Here again Ramanujan was under the misapprehension 
that . 2 dt 

a (logi)* 
was a much better approximation. 


* In fact J, +a. * See Lecture X. 
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Cc 
A note on ihe Mobius function y(n) 
4.9. Itis well known that certain theorems concerning M6bius’s function 
B(n) are “of the same depth” as the Prime Number Theorem; they are 


equivalent to it in the sense that they can be deduced from it, or it from them, 
in an “elementary”? manner. In particular this is true of the theorems 


(4.9.1) ZH) . 9 
n 
and 
(4.9.2) Mg) cn) = Ae), 


and these two theorems are naturally cquivalent in the same sense. That 
(4.9.2) follows from (4.9.1), indeed, is trivial, since the convergence of 


rin 
n 
implies A, = 4, +4,+...44, = o(n), 
whatever a,. The deduction of (4.9.1) from (4.9.2), while technically ‘“ele- 
mentary”, is much less immediate, and depends upon a rather delicate 
theorem of Axer and the special properties of s(7). 

There are assertions in Ramanujan’s first letter to me which imply his 
familiarity with (4.9.1) and (4.9.2). There is of course no question of his 
possessing a real proof. It is immediate that 

: fln) 4. 1 
ae aC 
and he would hardly have distinguished between this and (4.9.1). 

The assertions are those numbered (2) on p. xxiv of the Papers. If wisa 


number which is the product of an odd number of different prime factors, 
and U(x) the number of such numbers up to a, then 


0, 


3a: 
(4.9.3) U(e)~=5, 
1 9 
4.9.4 = 
( ) Pr Qn? 
1 15 
4.9.5 = 
(4.9.5) Bight 


The last two theorems are easy. We write g for a quadratfrei number, 
u inthe sense just explained, and v for a g which is not a wu. Then 


Hu) =—1, wv)=1 
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and zs et pele) an) 
But Pe is a 

vs 2 nf) =f 
im a fee 


for s> 1, and (4.9.4) and (4.9.5) are particular cases. 
The theorem (4.9.8) lies deeper, and depends on (4.9.2). If we define 
Q(x) and V(x) as we defined U(x), then 
Q(x) = U(a)+ Via) 
and M(x) = V(x)—U(z). 


It is quite easy to prove that 
6a 
Q(z) ay, 3? 


and (4.9.3) then follows from (4.9.2). Thus Ramanujan’s assertion here is 
just of the same ‘‘depth”’ as the Prime Number Theorem. 


NOTES ON LECTURE IV 


§4.2, Hadamard’s proof can be developed so as to show that ¢(s) has no zeros, 

and indeed. 
A 
(2) |>g—- 5, 
189) > Gere aay 
A 

flog (| # | + 2)}4" 
Here the A are appropriate positive constants. See Landau, Handbuch, 169-180. 

Ingham’s proof was published in his paper 2. It can be applied to all Dirichlet’s 
“Z-functions”, and in particular to 

L(s) =1-8 3-94.57... 

but it cannot bo developed in the same way as Hadamard’s. 

§4.3. The formula (4.3.1) is stated in the Papers, 135 (15), There is a proof in 
B. M. Wilson (1). 

For (4.3.3) see, for example, Hardy and Wright, 247-248. 
; Landau’s theorem was published in Math. Annalen, 61 (1905), 527-550. The proof 
is also given in the Handbuch, 697-698. 

§ 4.4. See the notes on p. 10 (printed on p. 20). 
; For Ramanujen’s actual assertions see Papers, xxiv and xxviii, It is plain that, 
if his statement about the order of a) were correct, it would be important whether 
we counted actual squares or not. 


ER 5 


in a region o>l 
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§ 4.5, The properties of €(s) assumed here are included among those proved by 
Landau in the part of the Handbuch reforred to in the note on § 4.2, For the extensions 
to L-functions, and in particular L(s) =1~*—3-*°+..., see Handbuch, 459-464. 


§ 4.6. The argument here is a little less unsophisticated than that of § 2.5, since 
there I assumed the truth of the Riemann Hypothosis, wheras here I assume nothing 
about the zeros of €(s) and L(s) which has not been proved (and have for that reason 
to use a curvilinear contour). In other respects the argument is as rough as in § 2.5, 

§ 4.8. See Stanley (1). The function 7(n) has many curious congruence properties: 
for example r(n) = 0 (mod 691) for almost all n. For theso, see Mordell (1), Watson (23), 
and Lecture X. 

§ 4.9. See Landau, Prac. Matematyczno-Fizycenych, 21 (1910), 97-177 (130-187), 
and the note on § 2.2. 


For the asymptotic formula for Q(z) seo Hardy and Wright, 267-268; Landau, 
Handbuch, 604-609. The theorem is due to Gegonbauer. 


V 
A LATTICE-POINT PROBLEM 


5.1. Suppose that D is a bounded region in the plane of u and », including 
the origin O inside it; and that D(z) is the result of magnifying D about O in 
the linear ratio zt: 1 or the areal ratio x: 1. About how many lattice-points 
(i.e. points with integral coordinates) are there, inside or on the boundary 
of D(x), when « is large? 

The most familiar of such “lattice-point problems” is Gauss’s “circle 
problem”. Here D is the unit circle and D(«) is the circle 


w+uRse. 
It is easy to show that, if N(x) is the number of lattice-points in D(x), then 
(8.1.1) N(#) = 12+ O(a") 


{the area of the circle, with an error of the order of the circumference). 
This theorem is almost intuitive, but is very far from cxpressing the final 
truth, more profound analysis of the problem having shown that the 4 in 
(5.1.1) can be replaced, first by 4, and then by various smaller numbers. 

We can also state the problem in a less geometrical form. If v(m) is the 
number of representations of an integer 7 as the sum of two squares (repre- 
sentations which differ in the order or sign of the bases of the squares being 
reckoned separately), then 7() is the number of lattice-points on the circle 
w+? =n, and 

N(x) = r(1)+7(2)+...+-7([2]) = Dd r(a); 

so that (5.1.1) may be written as os 

(5.1.2) E rn) = ma + Of), 

RSL 


and becomes a theorem about the “average order”’ of the arithmetical 
function r(x). 

There are many generalisations of the problem, to ellipses, and to 
hyperspheres and hyperellipsoids in space of any number of dimensions. 


5.2. Another famous lattice-point problem is ‘“‘Dirichlet’s divisor 
problem”. It is convenient to state this in a slightly different form. The 
cirele of §5.1 was symmetrical in all four quadrants, and we might have 
stated the problem for one quadrant only. If D(a) is now defined by 

“uz0, vz20, wv Sa, 
then N(x) = daa + O(e). 


68 A laitice-point problem 
We could equally well define D(x) by 
u>0, v>0, w+ersa, 
discarding the points on the axes; and we shall see that this modification 


is essential in the divisor problem. 
In the divisor problem D(x) is defined by 


u>Q0, uv>0, uvsz, 


so that N(x) is tho number of lattice-points between the axes and the 
rectangular hyperbola wy = x, counting those, if any, on the hyperbola but 
not those on the axes. There are an infinity of points on the axes, so that it 
is essential to exclude them. 

Tt was proved by Dirichlet that in this case 


(5.2.1) N(x) = cloga+ (2y—1) a+ O(24), 
where y is Euler’s constant. This theorem corresponds to (5.1.1), though 
its proof is not quite so trivial. As in the circle problem, the theorem has 


been refined by modern writers, the results being muchthe same. Alternative 
statements are (i) that 


(5.2.2) X d(n) = wloga+ (2y~—1)a+ O(z*), 


NSX 


d(n) being the number of divisors of n, and (ii) that 


(5.2.3) [a]+ [5 | " [| +o. = alogat (2y—l)a+ O(a). 
The last form is that used in Dirichlet’s proof. 


5.8. I do not know how much Ramanujan had thought about these 
problems in his early days. He was familiar with the dominant terms 


azlogx+ (2y—l)x 
of Dirichlet’s approximation, and had probably found them by an argument 
of the same kind. In this case he should have known (5.2.1). But he was 


unlikely to make any substantial contribution to the subject, since the sting 


of all these problems lies in estimates of error, about which his early ideas 
were quite vague. 


Thus in his first letter to me he says 
“d(1)+d(2)+...4+d(n) = nlogn+ (2y—1) n+ 4d(n)”. 


We should infer (if we took this statement at all strictly) that the order 


of the error in Dirichlet’s formula does not exceed that of d({z]), and in 
particular that 


N(x) = x loge + (2y—1)x+ O(2*) 
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for every positive ¢. This is false (indeed with ¢ ~ 4), though not very easy 
to disprove. It is however much more likely that Ramanujan inserted 
the term 4d(z) merely as a reeognition of a quite sound formal principle. 
When we are investigating the ‘‘sum-function” 


Aw@)= ay 


of an arithmetieal funetion a,, it is usually not A(x) but 
A(z) = X' dy, 
nie 


with the last term a@,,; multiplied by 4 when z is an integer, which presenta 


itself naturally in the analysis.* 
Later in his life, of course, Ramanujan was interested in these problems 
jn a more sophisticated way, though his contributions were not important. 


5.4. Here, however, { am not concerned with either of these elassical 
problems, but with one connected with another of Ramanujan’s assertions. 
This also is in his first letter to me, and runs 

“the number of numbers of the form 2¥3? less than 7 is 

log 2n log 31” 

2log2log3 ° 
The formula is of eourse intended as an approximation, and there is no 
evidenee to show how aecurate Ramanujan supposed it to be. 

Tt will be convenient to write 


qg=logn, w= log2, w’ = log3. 
Then Ramanujan’s assertion is that the number of solutions of the in- 
equalities 


(5.4.1) “20, vZ20, wutwvsy 
2 
s. 66 : ” q q A n 1 
is “approximately Boar + Dot do 3 


The 4 at the end, of course, is not to be taken too seriously; and we shall 
see that, if any constant has a right to stand there, it is not & 

T propose in this lecture to give a short aceount of some of the very eurious 
and interesting analysis which can be made to hang upon Ramanujan’s 
assertion. 


* As in Perron’s formula 
AM(e) = 5 =f oO je) de 
a= ty’ a= 8) -—~ ds, 
Ansa 2 2ii Lat é 
where J(s) = Za,e-An, Seo Hardy and Riesz, The general theory of Dirichlet’s 
series, 12, Here the last term is to be multiplied by Fif zis a A,. 
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5.5. Let us suppose then that w and w’ are any positive numbers, that 
N() = N(y,0,0') 


is the number of solutions of (5.4.1), ic. the number of lattice-points in a 
certain right-angled triangle, that 


SI sa ll tal 
(6) O(n) = Bua! * Bat Ba"? 
and that 
(8.5.2) N(q) = Q(y) + RY). 


The problem is that of finding the best bounds that we can for R(y). Ithas 
been considered, in different forms, by a number of writers, and in particular 
by Hardy and Littlewood and by Ostrowski. Hardy and Littlewood, in 
two papers published in 1921 and 1922, consider it in the form in which it 
is stated here. Ostrowski considers a slightly different problem, super- 
ficially more special but substantially equivalent, and obtains, by different 
methods, very much the same results. 

The problem is not changed materially if we disregard lattice-points on 
the axes. The horizontal and vertical sides of the triangle are /w and 4/w’, 
and the number of lattice-points on them is 


1+[2]+(4] = 14743, 
@ @ oOo w& 


where |j]S1. Hence, if we denote by M(y) the number of lattice-points 
inside the triangle or on its hypotenuse, then 


where r() differs from R(y) by 1 at most. 
It is plain first that 


(5.5.3) Non) = s+ 00) 
(the area of the triangle, with an error at most of the order of its perimeter). 
Beyond this everything depends upon the arithmetic nature of 
d= ww’. 
Since Ny, ,0') = N(ky, ko, ko’), 
it is only the ratio @ that is really relevant to the problem. 
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Rational @ 
3.6. The simplest case is that in which @ is rational. Then (since only 
the ratio of w and w’ is relevant) we may suppose that 
wo=a, wo =), 
where a and 6 are positive coprime integers. 
The number of solutions of 
(5.6.1) au+byv =n” 


is [5 | +€, 


where {is 0 or 1.7 It follows that N(y) has a jump of order y when 9 passes 
through the value 7, so that the equation 


R(q) = o(7) 
is false, R(y) being effectively of order 7. 


5.7. We can calculate N() explicitly as follows. If A, is the sequence of 


numbers 
au + bv (u,v = 0,1, 2, ...) 


? See for example Bachmann, Niedere Zahlentheorie, ii, 129. The proof is simple. 


Byppase: thet n=mad+r (OS7<ab), 


and write u=bU+f, v=aV+a (OSa<a, 0OSf<b); 
so that (5.6.1) becomes 
(5.6.2) mab-+r=(UO+V) ab+ af+ ba, 


Consider the set of ab numbers 
af + ba (OSa<a, 0Sf<b). 

All these numbers are less than 2a and incongruent (mod ab). Hence they can be 
written as 

p, ab+p’, 
where p and p’ together run through 

0, 1,...,a0-L. 

if af +ba = p, 
then (5.6.2) implies U+Veom, 


There is one pair «, f satisfying the first equation, and m-+1 pairs U, V satisfying 
the second, and each set U, V, «, # gives a solution of (5.6.2). On the other hand, if 


af+ba = abt+p’, 
then (6.6.2) gives 0+V=m~1, 


and in this case there are m solutions only. 
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arranged in order of magnitude, and with each A, counted as often as it 
occurs, and 


f(s) = Ler, 

1 Ne ee ete aaa ier, 
hen 7) re, 7 ate fay a 
Here c> 0 and the dash has the meaning explained in $5.3. But 

1 
= —(awtbujs . © ase 
Monee ee=\ aes 
so that 
1 petio en ds 
* a 
tae SA =e (ae) (ie) 5 


We calculate the integral as a sum of residues. The integrand has 
(i) a treble pole at the origin, with residue 


_— 7 9g + dab+d 
ae) PO) = 3957+ 3426+ Ta 
(i) double poles at the points 

§ = Qkri, 


where & is an integer, with residues 


e2kynt ] ae 
Sabai semi ttt ) 
ii) simple poles at the points 
pe p P 


$= ET at by, g= 2A (bk), 
] etkaty phbya 1] etkrity+tayb 
aka sin (kbafa)’ ~ 4k sin (kaz/b) 
respectively: and it is not difficult to show that the integral is equal to 


the sum of all these residues.3 A straightforward calculation then leads to 
the formula 


(5.7.3) N*(q) = P(g) + Q(q) + Taq) + Tol), 
where P(7) is defined by (5.7.2), 


with residues 


4b 
(5.1.4) Qq) = - TE sks (1 Ul- BP 


* By Perron’s formula already quoted on p. 69. 
* aty means ‘a is not a divisor of y’ (the contrary of aly). 
3 We apply Cauchy’s theorem to a rectangle 
c~tP, c+iT, ~E467T, —£~1T7, 
where P 1s so chosen that the horizontal sides of the rectangle do not pass within a 
fixed distance 3 of any pole, and then make T and & tend to infinity. 
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Qhr 
; cos (9 + 38) . cos" (n+ 4a) 
(5.7.5) Ty(9) =-F*" ble ? Pq) = ~geX _ akn 
k sin — ksin— 


the & in the last two series running through all integral values which are 
not multiples of a and 6 respectively. 
Jt is plain that 
Qn) = - 5 (n— tn} 3) +00), 
and 7',(7) and 7’,(y) are periodic (with periods a and b respectively), and so 
bounded. Hence 


(5.1.6) N*() = Py) -% (g-t]- 9) +002) 
for large 7. The second term has a discontinuity 
n 
ab + O(1) 


when 7 passes through an integral value , in accordance with our con- 
clusions in § 5.6. 
Irrational 0 


5.8. The problem is naturally more difficult when @ is irrational. It has 
been proved, in the first instance, that 


(5.8.1) R(q) = 0(7) 


for all irrational 6, so that 
(5.8.2) CO ee ee eee 
200’ 20 20 
is a genuine approximation to N(#); and there are sharper results for special 
classes of 6. These depend upon the nature of the rational approximations 
to 4, or (what is the same thing) on the behaviour of the quoticnts @, in 
the expression 


1 
G = a+ ae a & 
of @ as a continued fraction. If the a, do not increase very rapidly, then 
(5.8.3) Ry) = O(7"); 
for an a between 0 and 1; and if the a, are bounded, then 
(5.8.4) R(q) = O(log 7). 


In particular (5.8.3) is true for all algebraic and (5.8.4) for all quadratic 9. 
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The @ of Ramanujan’s problem is transcendental. Its irrationality is 


trivial, since log 2 tes 


log3 as 


would involve 3¢ = 2°. That it is transcendental follows from the theorem 
of Gelfond and Schneider, that 
ae 


is transcendental whenever a and f are algebraic and / irrational. For 3 and 
39 = 2 
are rational, and therefore ? cannot be algebraic. 
The most that we can prove about R(y), in Ramanujan’s case, is that 


Pay pele 
(5.8.5) R(y) = of jes): 
This I prove in § 5.15. The slightly less precise equation 
nd vi 
(5.8.6) Ri) = Ole -); 


which I prove in § 5.12, is substantially one of Ostrowski’s theorems. 


5.9. There is an “‘identity’’ for N*(y), when @ is irrational, which is 
similar in form to (5.7.3) but involves series which are not convergent in 
the ordinary sense. I shall not use this, but I shall give,a proof of (5.8.1) 
and (5.8.6) which is due substantially to Heilbronn. Heilbronn’ 8 argument 
is the same in principle as Ostrowski’s, but simpler. It is effective when we 
have to prove R(y) “‘only a little less than 7’, as it is in (5.8.1) and (5.8.6), 
but does not Jead, in this simple form, to the more precise results which, 
like (5.8.4), are true only for very “simple” 0. 


We define {2} by {x} = #—[a]—-4. 
This function has the analytic representation 


gin dare 
5 one A 


{a} = — 2 (sin Qa + 

n 

but we shall not use this here. 
The ordinate u = ” cuts the hypotenuse of the triangle in the point 


y-—nw 
bi , 2 


w 
and the number of lattice-points on it is 


1+[7-"|. 
@ 
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nw me 
Hence (7) z (1 +[25 |)- B (2S +5) x 
oe ngqjo\ n&qlo 


If i. [Z| +f, 


then the first sum is 


(e813 
GHB) EB Abe Aeedooteoo 


Hence 


n-ne 
w f° 


(6.9.1) Mn) = 5 es 5 = 2(q)— Sq) + (1), 
where 
(6.9.2) - S)= E, (nol 
and the proof of (5.8.1) is reduced to a proof that 
(5.9.3) S() = o(7). 
Ostrowski, in his paper referred to on p. 70, is concerned with the sum 
2) 


a series of much the same type as (5.9.2), and his arguments may be applied 
with equal effect to (5.9.2). 


5.10. If P _ Pm 
Y In 
is a convergent to the continued fraction for 0, then q,, tends to infinity 
with m and a 
9-2-2 {7 | 
q Gm Ym Imt1 
where Yn+1 = On 419m t+ Gmn-1 


and @,+1 is the complete quotient corresponding to a,,,4. We suppose 
that ¢<y/w and 


[2] =rq+s (r21,0Ss<q). 


wo 
Then 
(5.10.1) S(q) = oy (2 -no| + O(s) = S*(q)+ O(s) 
If we write 


N= Uge+v (w= 0,1,...,r-1; v= 0,1,...,¢-1), 
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then 
t~1lq-1 q wl 
(5.10.2) Sa)= > > (J-wa+n9) = & S,(7), 
#=0v=0 |W n=O 
where 


q-1 
(5.10.3) Sq) = py {@-v0}, a= Oy = 4 — 190. 


We prove in the next section that 
(5.10.4) 8,(n) = O(1), 


uniformly in «. If we assume this for a moment, then (5.10.1), (5.10.2), 
and (5.10.4) will give 


(5.10.5) S(7) = O(r) + O(s) = of 2) + O(Gm). 


Proof that SA) ts bounded 
5.11. Let us Suppose, to fix our ideas, that m is even. Then 


o<p—2<1 
q ¢ 
Also e= Ena 3 
q 
where a@ is an integer and 0x6 <7 
Thus a SOP s-(9-2) 
q q 
is less than I/q and greater than — 1/q, so that 
(5.11.1) a9 22 is 
| q q 


If m is odd, so that 6 is less than p/g, then we define a by 


te ag (ose<2), 
q q 


and (5.11.1) is then still true. Hence in any case 


(5.11.2) a—yo, 2’ 
g 


differ by less than I/q. But (p,q) = 1, and therefore 
a-— 
q 


ay +7,, 
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where i, is an integer and 7, runs, in some order, through the values 
ne ee 
qq q 
The integral parts of the numbers (5.11.2) can differ only if there is an 
integer between the numbers, and this can happen only if r, = 0, and 
therefore once only. In this case the difference 


£36 = |2a2 
[a—v0]} “a 


is 0 or —1. We call this difference e. 
It follows that 


qe _ 1 Ta—vp 
x [a—v] = = [= |+ 
wta-vp 1 2 g~-1 
7 9 @g 4 
=a—4(p+1)(q-1) +6, 
qa-1 q-1 qo 
8,(g) = x {a —v0} = x (a—vO—4)— x [a— v4] 
= o(a—4)—99(g— 1)0-a+3(p+ 1) (g-1)-e 


= ga—a—3a(q—1) (0-2)-1-< 


+€ 


1 
| S,(@)| $1 95|+4¢. Gth+1<3. 
This completes the proof of (5.10.4) and so of (5.10.5). 


5.12, We have thus 
S(n) = of 2) +0(4y) 


mM, 


Given any positive 6, and any sufficiently large 4, we can choose m so that 


1 
3<In< on, 
plate Qn < 269. 
Im 
Hence S(7) = o(4) for allirrational 0; and this, as we have seen, is equivalent 
to (5.8.1). 
Suppose next that @ has Ramanujan’s value. Then 
(5.12.1) j2¢—3P | >] 


for all p,g. This is | 1—eP 08 3-9 10g 2 | > 2-2, 
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from which it follows that 

(5.12.2) |g@-—p|>A2-2 
for a constant A and all gq. 

If » and g are p,, and g,,, then 


1 
YmO ~ Pin | < > 
Vm+1 


and so Grn < AQhe < eam, 


We can choose m so that 


Im s Te <Im+4t° 


4 
Then eA > nya > ing 


Ym > A log 4, 


and this proves (5.8.6). 
Proof of (5.8.5) 


5.13. It remains to prove (5.8.5). This is a very small improvement on 
(5.8.6), and no doubt says very much less than the final truth, so that it 
may seem odd to insist on it. But the proof depends on a very interesting 
theorem of Pillai, and it is for the sake of this theorem that I include it. 

We write down the sequence 


2, 3, 4, 8, 9, 16, 27, 32, 64, 81,..., 


formed by the powers of 2 and 3 arranged in order of magnitude, and call 
the n-th term of the sequence u,,. Then Pillai’s theorem asserts that uy44— Uy 
the n-th gap in the series, tends to infinity nearly as rapidly as u,. More 
precisely, given any positive d, 


| 27-84 | > Q0-ae 


for all integral x and y with z>2x,(d). 
It follows in particular that the equation 


(5.18.1) 22 3Y =f 


can have only a finite number of integral solutions for any given k. This is 
easier to prove. A very well-known theorem of Thue says that 


(5.13.2) AX?— BY? =] 


* The various A’s are naturally not the same. 
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(with any integral A, B,1) has only a finite number of solutions. If we 


now put 
z= 3E+p, y= 3n+0 (p,0 = 0,1, 2), 


we obtain nine equations 

20(98)9 ~ 3°(37)8 = i 
of the form (5.13.2). Hach of these has only a finite number of solutions, 
and so therefore has (5.13.1). 


5.14, Pillai proves, more generally, that if m,n,a,b are given positive 


integers, an®—bnv 20 


for any integral x and y, and 8 is positive, then 
| am? —bn¥ | >mG-he 

for all x >xo(5). Here x,(6), of course, depends on m,n, a and } as well as on d. 
Similarly, a K which appears in the argument may depend on m, n, a and b, 
as well as on any parameter indicated specially. 

We use one deep theorem of Siegel: if § is an algebraic number of degree r, 
then there is an A(t), depending only on £, such that 

eee 
| fo P | x (g) 
q 


gr 


for all integral » and q. There is a familiar theorem of Liouville in which 
7 stands in the place of 2rt, Thue showed that 7 could be replaced by any 
number greater than }r+1, but neither Liouville’s nor Thue’s theorems 
would be strong enough for Pillai’s application. What is essential is to have 
an exponent of lower order of magnitude than 7. 

Suppose now that u and v are positive integers, that a/b is not a perfect 
r-th power, and that jaut < bu" <au’. 

Ur 

If w= (5) = aU, 


then a is an algebraic number of degree 7 at most, and 
aul — but = b(u" ~ vo") > bro’ '(w —v) 


> K(r)ul(w—v) = Kir) w{a -2) . 
Hence 
(5.14.1) aut — bu" > K (yur, 


by Siegel’s theorem. It is obvious that this is also true if 0 <bo"< dau", so 
that it holds whenever au’ — bv" is positive, Similarly 


(5.14.2) bef — au" > K (rr 


80 A lattice-point problem 


whenever it is positive, and a moment’s consideration shows that we can 
unite (5.14.1) and (5.14.2) in the form 

(5.14.3) | aut — bv" | > K(r) a7? 
uw and v being any positive integers, and z being w or v at our discretion? 
Here K(r) of course depends on a and 6 as well as on ¢. 

We can now prove Pillai’s theorem. We take é small and positive and 


16 
r= 
and write weorth (OSh<r), y=ir+l (OSI<74), 
wom, ven. 
Then lam? —bn¥ | = |am*.ur—bn!.o" | > K(r, h, ut, 
by (5.14.3), and so |am™—bn¥ | > K(6) ater 


since the number of values of 4 and / concerned depends only on r, and 
r only on 36. 


Also ut = me) > K(d) m*, 
urrert  yfl-tr s K(8) mois; 
and so | am* — ba? | > K(8) mabe, 


From this it follows that 
| am? — bn | > mae 


for w>,(6), and this is Pillai’s theorem. 
5.15. Pillai’s theorem is just sufficient to improve (5.8.6) into (5.8.5). 
Returning to the argument of § 5.12, we can replace (5.12.1) by 
| 29 37 | 2B Qatr—a) 
(for any positive 6 and sufficiently large p,q); and (5.12.2) by 
| gO | > 2-84 = ee, 
where ¢ = dlog 2. It follows that 


Gma < em 


for sufficiently large m. We choose m so that 
26H 
Gm s logy < Qm+1 
1 1 1 
Then Gin > qs Imir> = (log 7 — log log 9 + log 2c) > oe log 9 


* Since a/b is not an r-th power, au"—bv" cannot be 0. If it is of smaller order than 
uf or v', then wv and v are of the same order. 
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for large 7 and m, and 


7 eee 
Int <jogn’ ri 
pape (pee) 
so that S(y) = ve | : 


We have thus replaced the O of (5.8.6) by o. Jt may seem rathor dis- 
appointing that the use of sueh powerful weapons should lead to so small 
an improvement in the final result, but such disappointmonts aro common 
in this kind of analysis. 


NOTES ON LECTURE V 


§ 5.1. There are two proofs of (5.1.1) in Hardy and Wright, 268-269. 
The essential difficulty of the circle problem is that of determining ©, the smallest 
value of £ such that N(x) = 10+ O(a8*¢) 


for every positive e. It follows from (5.1.1) that OS}. Sierpinski proved in 1906 
that OS}, van der Corput in 1923 that O<4, Littlewood and Walfisz in 1924 that 
O<s35. In the other direction, Hardy and Landau proved independontly in 1915 
that O24. There is a profound study of the problem, up to this stage, in Landau, 
Vorlesungen, ii, 183-308, 

The results have been improved ance by Nieland, Titchmarsh, and Vinogradov. 
The best result is Vinogradov’s @ $44. 

Fullor references will be found in Bohr and Cramér, Enzyhl. d. Math. Wiss. uo 8 
(1922), 823-824, and in two papers by Titchmarsh, Quarterly Journal of Math. 
(Oxford), 2 (1931), 161-173 and Proc. London Math. Soc. (2), 38 (1935), 96-115 and 
555. The paper by Littlewood and Walfisz is in Proc. Royal Soc. (A), 106 (1924), 
478-488, 

§ 5.2. For Dirichlet’s proof of (5.2.1) see, for oxample, Hardy and Wright, 
262-263. 

The history of this problem is very similar to that of the circle problem, though 
recent writers have tended to concentrate on the latter. In this problem @ is the 
smallest £ for which 

N(x) = x log e+ (2y—1) x + Olaé+s), 
It follows from (5.2.1) that 94. Voronoi proved in 1903 that @<4, Hardy and 
Landau in 1916 that @2=4, and van der Corput in 1922 that @<8%. For fuller 
references see Bohr and Cramér’s article quoted above, 815-822. 

The best known result here seems to be @ $43, proved by van der Corput in e later 
paper in Math. Annalen, 98 (1928), 697-717. 


§ 5.3. One example is the formula 
2 rin) r(n) 
1 a. en trvi{int+a)} 
) nba) nae +6) 


Here @ and b are positive and r(n) has the meaning of §6.1. 
It is easy to prove (1) by writing the series on the left as 


en8aV{intd)a}, 


5 
gr te—r2blef Sr-() em n? ai ete n W/E G2fo 
pa {2r(n) iS (x) af’ 
where Hx) = 1+ ee Qentme |, 


HR 6 
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and using the functional equation 


T ‘) 
Hx) = J (?) (5). 
a) \e 
There is @ generalisation for the ellipsoid, which I quote in the Quarterly Journal 
of Math, 46 (1915), 283. 
The formula remains valid so long as a and 4/b havo positive real parts. If we put 


a= xe, whoro 0<0<z7, and 2 is positive and non-integral, make 67, take the 
imaginary parts, and thon put b = 0, wo obtain 


r(n) 2 rn), 
; = Qaje+ > sin {274/(nx)}. 
y d<n<a(a—n) ve tn {2aa/(nee)} 
This is also the result of putting «= ~4% in a formula for Z(z—n)*r(n) which I 


proved (for « >0) in a paper in Proc. London Math. Soc. (2), 15 (1916), 192-213 (205). 
Neither of these deductions, of course, is a proof of (2), and I do not know that 
there is any proof standing in the literature. The formula is of the same type as the 
identity © a(n) 
r 

XY’ rn) = ae+.e25—— 
Osnsz ¥ 1 a 

(first proved in my paper in the Quarterly Journal quoted above). 
I imagine that the series on the right of (2) is summable by Cesaro or Rieszian 

means of any positive order, so long as x is not an intoger. 


$5.4. Papers, xxiv (3). 


§ 5.5. The principal papers by Hardy and Littlewood are in Proc. London Math. 
Soc. (2), 20 (1922), 15-36 and Abh. math. Semin. Hamburg, 1 (1922), 212-249; and 
that by Ostrowski in Abh. math. Semin. Hamburg, 1 (1922), 77-98 and 250-251. 
There is an account of these problems, with an exhaustive bibliography, in Koksma, 
“Diophantische Approximationen”’, Ergebnisse der Math. rv 4 (1936), Kap. rx. 


§ 5.8. The problem of proving the transcendentality of a4, under the conditions 
stated, was the seventh of the problems proposed by Hilbert in his address “‘Mathe- 
matische Probleme” to the seventh international congress of mathematicians in 
Paris in 1900. This address was published in German in Gottinger Nachrichten (1900), 
253-297, and in French, under the title “Sur los problémes futurs des mathé- 
watiques”’, in tho official report of the congress (Paris, 1902). See Koksma, J.c. supra, 


Kap. rv, especially pp. 64-65, where there are references to the papers of Gelfond 
and Schneider. 


§ 5.9. For the “identity” see Theorem 4 of the second of the papers by Hardy 
and Littlewood referred to under § 5.5. 


Dr Heilbronn conmunicated his proof to me personally. 
§ 5.10. The notation for continued fractions is that of Hardy and Wright, ch. x. 


§§ 5.13-14. Pillai’s theorem was proved in Journal Indian Math. Soc. 19 (1931), 
1-11. 

Pélya, Math. Zeitschrift, 1 (1918), 143-148, proved a general theorem from which 
it follows, as a very particular case, that (5.13.1) has only a finite number of solutions. 
Herschfeld, Bull. Amer. Math. Soc. 42 (1936), 231-234, has proved that there is at 
most one solution when & is sufficiently large. There are further results of this kind 
in another paper by Pillai, Jowrnal Indian Math. Soc. (2), 2 (1936), 119-122 and 215. 

For Thue’s and Siegel’s theorems see Landau, Vorlesungen, ii, 37-56. Siegel’s 


theorem is Satz 691. It is stated there in a slightly different form, and for rzZ 3, 
but it is trivial when r = 2. 


J,{2nj(nz)} 


VI 
RAMANUJAN’S WORK ON PARTITIONS 


6.1. A partition of nis a division of n into any number of positive integral 


parts. Thus 
4= 384+] =242=241]4]=14+14141 


has § partitions. The order in which the parts are arranged is irrelevant, so 
that we may think of them, if we please, as arranged in descending order. 
We denote the number of partitions of by p(”); thus p(1) = land p(4) = 5. 
It is convenient to define (0) as 1. 

A partition of n may be represented graphically by an array of dots or 
“nodes”. Thus 


represents the partition 6+3+3+2-+1 of 15. We could also read the graph 
vertically, when it would represent the partition 5+4+3+1+1+1. Two 
partitions so related are called conjugate. 

The largest part in either of these partitions is equal to the number of 
parts in the other. Generally, a graph with m rows represents, when read 
horizontally, a partition into m parts, while read vertically it represents a 
partition into parts the largest of which is m. It follows that the number 
of partitions of 2 into m parts is equal to the number of partitions into parts 
of which the largest is m; and that the number of partitions into at most 
m parts is equal to the number of partitions into parts which do not exceed m. 


6.2. There are many less obvious theorems about partitions which can 
be proved by a direct study of their graphs. I choose as an example 
F., Franklin’s beautiful proof of a famous identity of Euler. 

Euler’s identity is 

(6.2.1) (1-2) (1-2?) (1-29)... = l—-w—2?4+28427-.... 

The right-hand side is 


1+ > (= 1) fad ORD 4 ggARRID) S (—1)k atlas, 
ka1 


k=—o 
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It may also be written as 1 + Lent, 


where ¢, is 0 unless x = $4(3h + 1), and then (—1)*. 
Let us write (1—a) (1-2?) (1-9)... = Xy(n) 2”, 


and multiply out the product so as to obtain an arithmetical interpretation 
of the coefficient y(7). There is a term in x” corresponding to every partition 
of n into unequal parts: thus the partition 6 = 3+2+1 gives rise to a 
term (—1)%2°. Generally, a partition of n into wz unequal parts contributes 
(— 1} to y(n), 80 that 
y(n) = p.m) — Pol); 

where ,(n) and p,(n) are the numbers of partitions of n into an even and 
an odd number of unequal parts. We try to establish, as far as we can, 
a one-to-one correspondence between partitions of these two types. The 
correspondence cannot be complete, since a complete correspondence 
would show that »,(n) = »,(n) and y(n) = 0 for every 7. 

We take the graph G, which represents (read horizontally) any partition 
of » into any number of unequal parts, in descending order. We call 


(4) 


4B 

the lowest line AB the base # of the graph. From the extreme north-east 
node, we draw the longest south-westerly line possible in the graph; it 
might of course contain one node only. This line CDZ we call the slope 7 
of the graph. We write #<o when (as in graph G,) there are more nodes 
in o than in #, and use a similar notation in other cases. Then there are 
three possibilities. 

(i) B<o. We move f into a position parallel to and outside o, as shown 
in graph G,. This gives a new partition into decreasing unequal parts, and 
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into a number of such parts whose parity is opposite to that of the number 
in G,. 

We call this operation O, and the converse operation (removing o and 
placing it below f) 2. Itis plain that 2 is not possible, when £ < o, without 
violating the conditions of the graph. 

(ii) 8 = oc. In this case O is possible (as in graph G,) unless # meets 
(as in graph G,), when it is impossible. is not possible in either case. 

(iii) P > o. In this case O is always impossible. Qis possible (as in graph G;) 
unless 8 meets o and f= ¢+1 (as in graph G,). 2 is impossible in the 
last case because it would lead to a partition with two equal parts. 


To sum up: there is a (1,1) correspondence between the two types of 
partitions except in the cases exemplified by (G,) and (Gg). In the first of 
these exceptional cases n is of the form 

k+(e+1)+(k+2)+...4+(2k—1) = 4(3h2—8), 
and in this case there is an excess of one even or one odd partition according 
as k is even or odd. In the second case n is of the form 


(B41) + (b+ 2)+(443)+...42h = 4(32+4), 


and the excess is the same. Hence y(n) is 0 unless m = 4(342+4%), when 
y(n) = (—1)*. That is to say, y(m) = ¢,, and this is Euler’s theorem. 


6.3. Franklin’s proof is a very striking example of what may be done 
by elementary “combinatorial” arguments; but most of the theory of 
partitions requires a more analytical setting. 

The analytical theory was founded by Euler, and rests, like the analytical 
theory of primes, on the idea of a generating function, But the generating 
functions of the theory of partitions are power-series 


F(x) = Zf(n)2. 
The function F(n) is called the generating function of f(n), and is also said 
to enumerate f(n). 
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It is easy to find the generating function of p(n). This is the function 


1 


EG aaah ae) on 


(a function fundamental in the theory of elliptic functions). In fact, 
expanding each factor of F(x) by the binomial theorem, we have 


P(e) = (l4+auta?+a +...) (lta tatiats...) (L438 ta8+aot...)..., 


and a moment’s consideration shows that every partition of contributes 
just 1 to the coefficient of x”. Hence 


F(x) = Lp(n) a. 


It is equally easy to find the generating functions which enumerate 
partitions of into parts restricted in various ways. Thus 


1 
(—a)Q—25) (1-25)... 


enumerates partitions into odd parts; 

(1+a) (1+?) (1+29)... 
partitions into unequal parts; and 

(14+) (1+23)(1+25)... 
partitions into odd and unequal parts. Euler’s product 

(1-2) (1-2) (1a)... 


enumerates the function y(n) of § 6.2, the excess of the number of partitions 
of m into an even number of unequal parts over that of its partitions into 
an odd number, 


Similarly ([ take examples which I shall want to refer to later) 


1 
(1-2) (1~2?)... (1-2) 


enumerates partitions into parts not exceeding m or (what we have seen 
to be equivalent) into at most m parts; 


a¥ 
(1—2) (1-4)... @—2") 


enumerates the number of partitions of n ~N into at most m parts; and 


a 
(1-2?) (1~a*) ... (l—a?") 


Ramanujan’s work on partitions 87 
the number of partitions of n—N into at most m even parts, or of }(n—V) 
into at most m parts of any kind. Finally 

1 
(1—2) (1 —a4) 1-25) (1—2)...’ 
where the indices of x are the numbers 5m +1 and 5m +4, enumerates the 
partitions of x into parts of these two forms. 


Ramanujan’s congruences 


6.4. Very little is known about the arithmetical properties of p(n); we 
do not know, for example, when p(m) is odd or even. Ramanujan was the 
first, and up to now the only, mathematician to discover any such properties; 
and his theorems were discovered, in the first instance, by observation. 
MacMahon had calculated, for other purposes to which I shall refer later, 
a table of p(x) for the first 200 values of n, and Ramanujan observed that 
the table indicated certain simple congruence properties of p(n). In 
particular, the numbers of the partitions of numbers 5m+4, 7m+5, and 
11m +6 are divisible by 5, 7 and 11 respectively: ie. 


(6.4.1) p(Sm+4)=0 (mod 5), 
(6.4.2) p(im+5)=0 (mod 7), 
(6.4.3) p(llm+6)=0 (mod 11). 


Thus p(4) = 5 and (5) = 7. 


6.5. Ramanujan found comparatively simple proofs of (6.4.1) and 
(6.4.2). These depend on two formulae which belong properly to the theory 
of elliptic functions, Euler’s formula (6.2.1) and Jacobi’s formula 


(6.5.1) {(1—2) (1—a) (1-23) ...}8 = 1-304 528-728 +..., 


where the indices on the right are the triangular numbers $4(4+ 1). We have 
proved (6.2.1), but there is no equally simple proof of (6.5.1). We may also 
write (6.5.1) as 


{(1 a) (1-22) (1—a9),...}8 = £5 (— 1)! (2k+ aden, 


Ramanujan now argues as follows. We have 
w{(1—a) (1—a?)...}4 = af(l—2) (1—2*)...}.{(1—2) (1-22)...}8 
= @(1—-a—a?+a5+...)(1—3a+ 505 7408+...), 
by (6.2.1) and (6.5.1). We write this as 
(6.5.2)  a{(1—a) (1 — 2?) ...}4 = $2 0(— 1)ete(2y 1) gh taeeet Dt ety 
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both # and vy running from —0o to 00, and we consider in what circumstances 
the index of a is divisible by 5. This demands that 
O(u+ 1)P+ (2v-+ 1)? = BE +4 + 1) + Ho + 1)}—lop2—5 
shall also be a multiple of 5. Now 
2(u@+1)?=0,2,0or3 (mod 5) 
and (2v+1)?=0,lor4 (mod 5), 
as we can see by enumerating the possible cases; and the sum of one residue 


from each set can be 0 or 5 only if each is 0. Hence, if the index in (6.5.2) 
is a multiple of 5, the coefficient 2y+ 1 is also a multiple of 5, and therefore 


ihe coefficient of 2°™+5 in af(1—2) (1—2) ...}4 
is a multiple of 5. 
Next, in the binomial expansion of 
1 
(1-2)? 
all the coefficients are divisible by 5 except those of 1, x5, x1, ..., which have 
residue 1 (mod 5), That is to say 


1 1 
fa) 1 a (mod 5) 
5 
or aay"! (mod 5). 


Hence the coefficient of 25”+5 in 


s (1—2#5) (1 ~a1°) ,., 
(1—2)(1—22)... 


is a multiple of 5, and so therefore is that in 


1—#4)(1—4)),.. 
= (2) (1-84) ogy daa oP 


x 
(I1—2) (1-2) (1—a8)...° 
and this coefficient is p(5m +4). 
We can prove (6.4.2) similarly. In this case we write 
a{(1 —a) (1—w?) ...38 = 2?(1 — 8a + 543 —7a8+..,)? 
= 425( — 192 +1) (Qy +1) eet thoy), 


and observe that 
(24 +1) + (241)? = 82+ du(at1)+4r(vt D}- 14 


can be divisible by 7 only if 24+1 and 2v+1 are both divisible by 7. The 
proof may then be completed on the same lines as before. There does not 
seem to be any equally simple proof of (6.4.3). 


* Corresponding coefficients are congruent (mod 5). 
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6.6. Ramanujan went a good deal further. He proved congruences with 
moduli 5, 7? and 11%, that for 5° being 


p(25m+24)=0 (mod 5), 


and put forward a general conjecture: if 


é = §27911¢ 
and 24A=1 (mod 4), 
then p(md+A)=0 (mod 3) 


for every m. It would be sufficient to prove the congruence for the special 
moduli 5%, 7° and 11°, the general congruence being a corollary. 

This conjecture has led to a good deal of work, and it has been found 
that Ramanujan generalised too far. Gupta, extending MacMahon’s 
calculations of p(n) up to 2 = 300, found that 

(243) = 133978259344888, 
a number not divisible by 75; and 8. Chowla observed that, since 
24,243=1 (mod 73), 


this contradicts Ramanujan’s conjecture. On the other hand Kre¢mar has 
proved the congruence 

p(125m+99)=0 (mod 5%), 
and Watson the congruence for general 5%; and D. H. Lehmer has verified 
the conjecture, in certain special cases, for 118 and 114. Lehmer’s work 


involves the calculation of some particular very large values of p(n) by 
a method which I shall explain in Lecture VIII: the largest is 


(6.6.1) (14031) = 92 85303 04759 09931 69434 85156 67127 75089 
29160 56358 46500 54568 28164 58081 50403 
46756 75123 95895 59113 47418 88383 22063 
43272 91599 91345 00745, 


a number divisible by 114. 


6.7. There is another proof of (6.4.1) which is much more difficult than 
the one I quoted, but which says much more and led Ramanujan much 
deeper into the theory of elliptic modular functions. In the same paper 
in which he proved (6.4.1) and (6.4.2), Ramanujan stated without proof 
the two remarkable identities 

GID pase episiars,,.c ieee USS 

ate p p(9) e+ p14) a? +... tas Ee Hie x3), .-}8 
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and 


‘ _ a{(l—2#") (1 ~2#44) (1 - oe 8 
(6.7.2) »(5)+9(12)e+p(17)2?+...=7 {i —a) (1-27) (1a)... 
{(1—27) (1 — a4) ( a ay 
{(1—2) (1-2?) (1-2) 38° 
These make (6.4.1) and (6.4.2) intuitive, and also provide proofs of the 
congruences to moduli 5° and 7°, Thus, if we assume (6.7.1), we have 


+ 49% 


pl4)a+p(9)x?+... x ee x)... 
5{(1 — a) (1—al0) 1.34 (1-2) (1-24)... (1-2) (1-2?) ..8 
© F 
“eae 


Hence (after what we have proved already) the coefficient of «*"+> on the 
left-hand side is a multiple of 5; and from this it follows that 


p(25m+ 24)=0 (mod 5). 
Similarly (6.7.2) leads to 
p(49m+47)=0 (mod 7). 


Ramanujan never published a complete proof of (6.7.1) or (6.7.2); but 
proofs have been found by Darling and Mordell. 


The Rogers-Ramanujan identities 
6.8. I come next to two formulae, the ‘‘Rogers-Ramanujan identities”, 
in which Ramanujan had been anticipated by a much less famous mathe- 
matician, but which are certainly as remarkable as any which even he ever 
wrote down. 
The Rogers-Ramanujan identities are 


x gt gn 


C80 oa Gaya aay 
1 
~ (12) (1—#)... (1-2 (1-2)... 
and 
x id ; gemtm+1) 
O82) “me aaa Ora a 
1 


~ (=a?) (1-27)... d—a) (1-2)... 
The exponents in the denominators on the right form in each case two 


arithmetical progressions with the difference 5. This is the surprise of the 
formulae; the “basic series” on the left are of a comparatively familiar type. 
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The formulae have a very curious history. They were found first in 1894 
by Rogers, a mathematician of great talent but comparatively little 
reputation, now remembered mainly from Ramanujan’s rediscovery of his 
work. Rogers was a fine analyst, whose gifts were, on a smaller scale, not 
unlike Ramanujan’s; but no one paid much attention to anything he did, and 
the particular paper im which he proved the formulae was quite neglected. 

Ramanujan rediscovered the formulae sometime before 1913. He had 
then no proof (and knew that he had none), and none of the mathematicians 
to whom I communicated the formulae could find one. They are therefore 
stated without proof in the second volume of MacMahon’s Combinatory 
analysis. 

The mystery was solved, trebly, in 1917. In that year Ramanujan, 
looking through old volumes of the Proceedings of the London Mathe- 
matical Society, came accidentally across Rogers’s paper. I can remember 
very well his surprise, and the admiration which he expressed for Rogers’s 
work, A correspondence followed in the course of which Rogers was led to 
a considerable simplification of his original proof. About the same time 
I. Schur, who was then cut off from England by the war, rediscovered the 
identities again. Schur published two proofs, one of which is ‘‘com- 
binatorial” and quite unlike any other proof known. There are now seven 
published proofs, the four referred to already, the two much simpler proofs 
found later by Rogers and Ramanujan and published in the Papers, and 
a much later proof by Watson based on quite different ideas. None of these 
proofs can be called both “simple” and “‘straightforward”, since the 
simplest are essentially verifications; and no doubt it would be unreasonable 
to expect a really easy proof. 


6.9. MacMahon and Schur showed that the theorems have a simple 
combinatorial interpretation. I take the first. We can exhibit a square m? as 
14+3+5+...+(2m—1), 


or in the manner shown by the black dots of (G,). If we now take any 
partition of x — 2m? into m parts at most, with the parts in descending order, 


. . . ry . . . ° ° Q ° 
. . . . . o ° ° 

(@) 
. . . oO i) 3° 


and add it to the graph, as shown by the circles of (G,), where m = 4 and 
n= 42.111 = 27, we obtain a partition of n (here 


27 = 114+8+6+2) 
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into parts without repetitions or sequences,’ or parts whose minimal 
difference is 2, The partitions of » of this type, associated with a particular 


m, are enumerated by 
2 
gm 


(1—2) (1—2?)...(1—amy’ 


the general term of the scries on the left in (6.8.1); and the whole series 
enumerates all such partitions of n. 

On the other hand the right-hand side enumerates partitions into numbers 
5m+1 and 5m+4. Hence (6.8.1) may be restated as a “combinatorial” 
theorem: the number of partitions of n with minimal difference 2 is equal to 
the number of partitions into parts 5m+1 and 5m+4, Thus when n= 9 
there are 5 partitions of each type; 


9, 811, 742, 643, 54341 
of the first kind, and 


9, 6414141, 44441, 44141414141, 
1+14+1414+141414141 


of the second. There is a similar combinatorial interpretation of (6.8.2). 
These forms of the theorems are MacMahon’s (or Schur’s); neither Rogers 
nor Ramanujan ever considered their combinatorial aspect. It is natural 
to ask for a proof in which we set up, by “combinatorial” arguments, a 
direct correspondence between the two sets of partitions, but no such proof 
is known. Schur’s ‘combinatorial’ proof is based, not on (6.8.1) itself, 
but on a transformation of the formula which J will mention in a moment.’ 
It is not unlike Franklin’s proof of (6.2.1), but a good deal more complicated. 


6.10. The proofs given ultimately by Rogers and Ramanujan are much 
the same, but Rogers’s form is a little easier to follow. 
We can write the right-hand side of (6.8.1) as 


1 TA —a5™) (1 — 42) (1 — ab 48)} 
(T= xi) (1 — gmt} (1—2) (l—22) (1-2)... - 


and the numerator on the right can be transformed, by a standard formula 
from the theory of the theta-functions, into 


l—2?@-—284+29+gN—..,, 
where the indices are the numbers 
R(5n2 tn) (n= 0,1,2,...). 


* Parts differing by 1. 
* (6.10.1), with each side multiplied by (1—2) (1~2?).... 
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We have therefore to prove that 


x a l—g?—284294+aN_-... 
(101) ltt iy Q=2)0=e) G22) 0-7). 


Similarly (6.8.2) is equivalent to 
ae pee ee _ l-an~attattoB~... 
+q-ad-at * @-e 0-0-2...’ 


the indices in the numerator on the right being the numbers 3(5n? + 37). 


x 
(6.10.2) 1+5—— 


6.41. We use the auxiliary function 


(6.11.1) Gy = G,(a,%) = SX (— 1) arrginGatd—kn(] — gkgrhn) 7, 
n=0 


where & is 0, 1, or 2 and 


Gwe 6 _ (=a) (laze)... (1- ae") 


(1-2) (1-2)... (1-27) 
Thus 
(6.11.2) G, = (1—@*) C)— a8 *(1 — ake*) O, + atet-2k(] — akytk) CO, —.... 


If a+0 then G@, = 0 for all x Also 


(6.11.3) G,@,2) =1-z—at+a7+a—... 
and 
(6.11.4) G,(a,2) =1l—a®—2'+a294eU-.., 


are the series which occur in (6.10.2) and (6.10.1). 
If the operator 7 is defined by 


nf@, x) = f (az, x), 


then 
i (l-—ag)...(1~az")  1l-—aaz 
6.11. = = 
; 1) We ia sean ge 
an 


_ (l—ae)...(l-aet1)  l-2t 
(6.11.6) Cnr Ana) ina) Toa 


Hence 
(6.11.7) (1-#")C, = (l-a)90,4, (1—ax”) C,, = (1-4) Cn, 
and in particular 40, = C= 1. 


If & is 1 or 2, then 


ay 
Gy — Gag = SE (— 1p atryinGntt)—kag] — ghytn — yk] — ak-ly%k—1)n)} Ch, 
n= 


=a-ll—a)+ 5 (-1 JRatngdnisnt+l)lenf(] — gk) 4 gk-tg@k—1a(] — gan 3 C, 


n=l 


ug ak-l ]— a) 9Cy zie (Ql vz a) > ( oa 1)" qinginGntl—kn ing) 4 + ak-lz@k—-Iny C,}; 


n=l 
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by (6.11.7). When we rearrange this series in terms of 7C, 704, ..., the 
coefficient of 7C,, is 

( =] ye (1 ne a) {q2nrk—lgkaln-+l)+ (kn — g2tt2gh(in-+1) (Sn+6)—kin+1)) 


=(-1)"(1- a) arth leimont DHE—Dnf] _ 8K nti) 


Hence 
ao 
G, = Gay ~ er ~—a) safe F (- 1)*a@2mgtnlint)Hk—Dnf 7 as ASK f8—K) ant 9Cy. 
coe} 
But Gs_z a x. ( tly 1)” arg hn(Sn+1)—B—Hynf ee a8—ky@—han) Ch 
n= 


co) 
and so NGs_z =) ( aa 1)" Png iran DHE-Dnf] — a8kyG—h arty) Ch; 
n=0 


and therefore 
(6.11.8) Gy — Gaya = (l—a) aF¥ “7 G_, (& = 1,2). 


6.12. If now 


H, = H,{a, 2) = Gy (a, x) 


(1-4) (1 —az) (1~—ax?)... 
(so that H, = 0), then (6.11.8) becomes 
Hy, - Ay = a sy. 


In particular 


(6.12.1) H,=9H,, H,—H, = ay, 
and so 
(6.12.2) A, = nl, + ay?H,. 


Suppose now that A, =1+ca+e,a7+..., 


where the coefficients depend on x only. Substituting into (6.12.2), we 
obtain 


l+ea+eg+... = l+cyar+c.a%a® +... +a(1 +e,a2" + cga®et+...). 


Hence, equating coefficients, 


xe a4 
iS eg? C2 = Tyee os = 7 yp 
F gn) 
oe + a al)... dar) 
and hence 
G,(a, # a aa? 
Wray = alee) = 14 
eas 


+ 


d-a)0-#) 0a)" 
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Also 


G(a, a) eat sa es = ae 
Genie aen = he Re ie 
atx a3y9 


(I—2) (1—a4) * (T—2) (1-24) 0-23) 


+ 


Finally, putting a =a in these two formulae, and using (6.11.4) and 
(6.11.3), we obtain (6.10.1) and (6.10.2). 

This proof is elementary, and reasonably simple; but it is undeniably 
rather artificial. It is a “verification”; we verify that the series (6.11.1) 
satisfies a functional equation, and the argument gives no explanation of 
our choice of this particular series. 


6.13. There is another proof by Rogers which seems to assume a little 
more but is really more illuminating.’ 
We shorten our formulae by writing 


ihe Sal vid = 
. Lp = l—a, 2y! = 2yX,...2y, 


and begin by expanding the function 


f(a) = [](. +42") 


1 


in powers of a. The function satisfies 


f(a) = (+ ax) f (az). 


Substituting a power series in a for f(a), and equating coefficients, we find 
without difficulty that 
gina) 


x x8 
a) = 1+—a4+—-a? +... A es 
f(2) oe ee tut any 


x,! 


Replacing a by ae and ae”, and multiplying the resulting series, we 
find that 


(6.13.1) @(2,0,a) = Td +2az” cos 6+ a%r”) 
1 


Garey rte ; « ; 2 ‘ 
= (14 Face Sater. L4+ act gteti8 
x! Ly! x! Xe! 


B,(0) 


z,,! 


=14 pan ar, 
1 


* The argument of §§ 10-12, ifregarded as a proof of (6.10.1), assumes nothing, though 
some knowledge of theta-functions is required to identify (6.8.1) and (6.10.1). In 
the proof here we use formulae from the theory of theta-functions in the proof. 
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where 
6 ann) Lv. a 
(6.13.2) Brl¢ Le AE (. +2 2a cog 20 -— Beam 2x4 cos 40+... 
Voy! yt by! nad Cnsitnre 
ene nan Ft ont cog 2n8), 
Gata ara +++ Con 
6 gpltrrl? & 
(6.13.3) Bansal) Se (2 cos +—* 22? cos 30 
Loni! By! tna! nee 
a Soe 228 cos BO +... +-——BO Red __ ay nin-+l) ogg (Qn+ 1)6), 
Vy oenag Type nsg a= ons 


Finally, we replace @ in (6.13.1) by x-#, and use a standard formula from 
the theory of the theta-functions, viz. 


O(a, 0, a4) = TT(1 + 20-4 cos + 42-1) 


1+ 2x4 cos A+ 2x? cos 20+ 2x? cos 39 +... 
(l—a) (1—#?) (1-2)... 


il 


We thus obtain 


1+ 22% cos 6 + 2x3 cos 20 + 2a? cos 204+... 


pa onl?) 
(2) —2) (1-24). ee 


nr 
x,! 


(6.13.4) ta: 


6.14. If we replace B,(0), in (6.13.4), by its explicit expression (6.13.2) 
or (6.13.3), and rearrange the right-hand side as a trigonometrical series, 
we obtain an equality between two convergent trigonometrical series. Such 
an equality must be an identity, the coefficients of cos n@ in the two series 
being the same. It follows that we may replace 


1, 2cos@, 2cos 20, 2 cos 36, ... 


by any numbers for which the series remain convergent. 
If we replace 
1, 2cos 26, 2.cos 40,..., 2cos2n8, ... 


by 1, —(1+a), e(1+2%), ..., (—1)*aino-D(1 +27), 
and all the odd cosines by 0, then B,,,(6) becomes 


Lon! x. LyX 
(6.14.1) B = Fel emia — na be fae rt. 814 22) — 
me Ly! By | Xn tt ) Xnsitnee 
+ (= 1p etn Fa pancon— (1 4 ny}, 


Vieni Bi 
, NHELY NE 9+ Van 
and we obtain 


Be By 1-2-3 +e%4qH_,, 
6.14.2) 14 f2¢a4Pagey 8 a 
HEE) OE pera Pk cr ere gay eemay 
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where the right-hand side is the same as in (6.10.1). On the other hand, 


if we replace 
2cos 4, 200830, ..., 2cos(2n+1)4, ... 


by (l—a), -(1-), ..., (—1)*ain@—-2(] ~ 2241), 
and the even cosines by 0, then B,,,,(0) becomes 
(6.14.3) 
= Banga! pe—2 ——2 (1 —2°) 4 ntl 7] —2)— 
Panes By! Bn4s! Fng2 ( Tn+2%n+3 


4 (— 1) San Ft _ phntn 0] ~am)| . 
Lnze%neg s+ Contr 
Multiplying by a, we obtain 


4 vty fs og Ie 
(=a) (24) (1-28)? 


where the Lee side is the same as in (6.10.2). It remaims to prove 
that the series on the left in (6.14.2) and (6.14.4) are the same as in (6.10.1) 
and (6.10.2). 


(6.14.4) 


6.15. We can do this by evaluating £,, in an elementary manner. But 
before doing this I observe that the substitutions of § 6.14 correspond to 
linear analytical transformations. Thus if « = e-? then 


| ° e278 cos 8.2cos 2nO.dé -{° e2foos (2n ~1) G+ cos(2n+1) Hd 


= af (hr) {eben Ba e Kent 0"8} = (m8) a, getrn—-D(] +a"), 


Hence the first substitution of § 6.14 corresponds to the result of (i) replacing 
8 by 6+ 47 and (ii) operating with 


Je =) xf” e278 cos @ ... dO. 
m6 = 


Similarly it may be verified that the second is the result of (i) replacing 
0 by 6+ 47 and (ii) operating with 


-MG)af e208 sin 20... db. 
76, nis 


6.16. We now write 


el 
2) 
(6.16.1) fo, = panne, Bona = Pansat UE tt 
X,! Ly! Ly a mat! 


and prove that 


(6.16.2) Yon =iq!s Vener = %n4a' 
HR 7 
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We shall then have 


pa 1 = +1), os 
(6.16.38) Bon = 2M ss@n yo ony Bangs = CM nga Congas 


and it may be verified at once that the series on the left of (6.14.2) and 
(6.14.4) reduce to the Rogers-Ramanujan series. 
To prove (6.16.2) it is enough to prove that 


(6.16.4) Vonti = %ni1Van Yonte = Yanti: 
Now 
a, te Pe ee ee ae 
Von = 1l-—a(1+e panned 45 (] + ap?) — Bd ned 18 4 8) 4, 
Tata T4142 X41 % 42% +3 
x x Le (a ae bed 
= (1-2 2] oo? —2 (1-28 ws 1) +27 mene (1 go 2 ‘)-.. 
Cnat Tay Tn+ Xn 41% pie Tr+3, 
1 x Cotes 
= ——(l~a) —2?——2— (1-23) +2? 2 24 —_(1~—25)-.., 
Unt Tn41%n+e Tnylnpolnis 
__ Vent 
a ? 
Cnty 
and 
x boa open = 
Yans = (1-2) ——*-2°(1 — a8) +22) 9°71 — a5) 
nto Tn42% 4s 
a 
— ened yl] at) +... 
Cy 12%n43%ns4 
a, a, Cn Cn Con Ln 
= 1a 40s) gt Bo (14 q¢Znat) gue Eaten (14 20%!) 4. 
Tn+2 Trta Tn43 Tro neg Vn +4 
%, Carre, Cea hy@. 
= 1—-—12(1 oe) +R 9501 4 gy) nd nnd 912] 49738) 4... 
rte Tr4eln+3 Tns2Uni genta 
= Vente 


These are the relations required. 
The equations (6.16.2) are 


(6.16.5) 1-1 +2) + (1-2) (l-—a"—) Fe oe 


i- (ah (as) 


= (1-2) (1—2?)... (1-2) 
and 


(6.16.6) (12) — LoS (1-28) + 


(1-a") (1-27) 
(1 —am+?) (1— a3) 


= (l—2)(1—2?)... (l—a™*). 


a?(1—25) — 


Each of them reduces, when 7 ->00, to 


l—a—+a5+a7?—.., = (1—x)(1—2*) (1—23)..., 
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Euler’s identity; and the argument of this section gives a particularly 
simple proof of the identity. We shall be led to the formulae (6.16.5) and 
(6.16.6) in a different manner later.’ 


6.17. It follows from (6.12.2), or may be verified directly, that 


oe art . 
l—xz' (l—2)(1~2*) 


F(a) = A,(a,x) =1+ 
satisfies the functional equation 
F(a) = F(ax)+axF (ax?). 
From this it follows that 
F(a) 


F(az) ees F (az) 


-14 ax? ax 
~ "141414... 
In particular 
Fe age F(1) 


*TPIPT4+.0 ~ Fe) 
_ (1=2*) (1-27)... (1-24) (1-2)... 
~ (l=) (1-29) ... (L-24) (1-2)... 


_ l-v-284e%+a0—... 
~ l-a-at+a% te... 


is a quotient of elliptic theta-functions, which may be evaluated for certain 
special values of x. This formula is the key to Ramanujan’s evaluations of 
the continued fraction for special values of z, which I quoted in my first 
lecture. 


NOTES ON LECTURE VI 


This lecture contains a good deal of tho substance of Hardy and Wright, ch. 19, 
and thore is inevitably a certain amount of repetition; but the account here is 
naturally less systematic. There is nothing in Hardy and Wright corresponding to 
§§ 6.13-16. 

§ 6.2. See Hardy and Wright, § 19.11, or MacMahon, Combinatory analysis, ii, 
21~23. Franklin’s proof was first published in Comptes rendus, 92 (1881), 448-450. 

Both Hardy and Wright and MacMahon give other examples of ‘graphical’ 
proofs. 


§6.3. For a more rigorous proof that F(z) enumerates p(n), see Hardy and - 
Wright, § 19.3. 

§§ 6.4-5. Compare Hardy and Wright, § 19.12, where however there is no proof 
of (6.4.2). There are alternativo proofs of (6.4.1) and (6.4.2), and a proof of (6.4.3), 
in no. 30 of the Papers. Darling (3) gave further proofs of (6.4.1) and (6.4.2). 


* See Lecture VII, § 7.8. 


B7ETTM RE. 
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There are interesting remarks on the parity of p(n) in MacMahon’s paper 2. Mac. 
Mahon does not prove any general theorem, but gives recurrence congruences (mod 2) 
by which it is possible to calculate the parity of p(n), for quite large n, very quickly, 
Thus he proves, in ‘about five minutes work’, that p(1000) is odd. 

One of the standard proofs of (6.5.1) is reproduced in Hardy and Wright, § 19.9. 


§§ 6.6-7. Ramanujan’s proofs of the congruences for moduli 5%, 72, and 112 are 
contained in an unpublished manuscript now in the possession of Prof. Watson. 
Darling (2) gavo a proof of (6.7.1), and Mordell (1) much shorter proofs of both 
(6.7.1) and (6.7.2). 

The references relevant to the work of Chowla, Gupta, Kretmar, Lehmer, and 
Watson are §. Chowla (1); Gupta (1, 2, 3); Kreémar (1); D. H. Lohmer (1, 3); and 
Watson (24). 


§6.8. Rogers (1): the identities are formulae (1) and (2) of §5. Rogers also 
anticipated ‘Hélder’s inequality’ (and is quoted by Holder), but without writing 
it in the standard form or recognising its fundamental importance. See Hardy, 
Littlewood, and Pélya, Inequalities, 25 and 311. 

Roger’s two later proofs are in his papers 2 and 4: the latter contains the proof 
given in §§ 6.10~6.12, the former that given in §§ 6.13-6.16. 

For Ramanujan’s own proof see no. 26 of the Papers. Schur’s two proofs appeared 
in Berliner Sttzungsberichte (1917), 301-321, and Watson’s in Watson (3). 

Ramanujan does not scem to have stated the formulae explicitly in his letters to 
me, but formulae IX, (4)-(7), of his first letter depend upon them. Sco Lecture I, 
formulae (1.10}{1.12); Papers, xxvii; and Watson (4). Ramanujan proposed the 
formulae as a problem in Journal Indian Math. Soc. 6 (1914), 199; see Papers, 330. 


§ 6.9. See Hardy and Wright, § 19.138, and MacMahon, Combinatory analysis, ii, 
33-36. 

§§ 6.10-6:12. See Hardy and Wright, § 19.14. The theta-function formulae required 
at the beginning of the proof are proved in §§ 19.8-19.9 (Theorems 355 and 366). 

§ 6.13. The expansion of f(a) in powers of a goes back to Euler. The theta-function 


formula is proved in Hardy and Wright, § 19.8, or in any of the standard treatises 
on elliptic functions. 


§ 6.17. See Lecture I, formulae (1.10)-(1.12). Proofs were given by Watson (4). 


VII 


HYPERGEOMETRIC SERIES 


71. Ramanujan’s work on hypergeometric series is contained in two 
chapters of his notebook which I analysed and edited after his death. This 
analysis has since occasioned a small flood of papers, by Bailey, Watson, 
Whipple and others, in the publications of the London Mathematical 
Society. There is an excellent account of the results of all these researches 
in Bailey’s tract. 

The problem which dominates Ch. 10 of the notebook is that of the 
summation of the series 


(7.1.1) ( 


1, Xo, =) 1+ My Gy  %y(%, +1) Ho(%y + 1) wa(a%g + 1) 


Bue LA U2 cee Dae 


and the chapter contains practically everything known about this series 
before 1922. When £, = a the series reduces to an ordinary hypergeometric 
series, and its sum is given by Gausa’s formula 


(7.1.2) 
pl tr 14 2% Oy (061 + 1) op (% + 1) am LB) (Bi %1 — %) 
( Ay ) Be ya 1.2. f,(8, + 1) IB, — &) LB, — a)” 


It will be necessary to use hypergeometric series of higher order, and 
T must begin by explaining the standard notations. We write 
a™ = afatl)...(a+n—1), ay) = a(a—1)...(a—n4+1), 
(dey & 
i Oooy oa = . Ba AT AAD a 
Br Bors By not! BOBO... BY 
The series is generally convergent for all x if p<q, and for |x|<1 when 


p=qtl. When p>q+1itis divergent for all # unless one of the & is 0 or 
a negative integer. We shall usually omit the argument 1 when « = 1. Thus 


alee 2 =F ie Oy, =) — FP (Ga Xe 1). 
Buby PON Bab TN Bu Ba” 


‘ Here and elsewhere I ignore questions of convergence. The reader should be able 
to supply the conditions for the validity of any formulae which I quote or prove. 


and oly 
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7.2. The key formula of the chapter is 


m2 SU Y+Z+Ut 23+ 1M Ln) 
Dh yr (~1)"(s +2 w 
(7.2.1) 2! 1)"*(s 4-2n) ml @+ytetUt ay 2tiylerer ly 
= 8 Patst+ i lytetutst)) 
s+ Plet+yt2+utst leyzu Le+u+s+1) : 


where one of the five arguments 
(7.2.2) @, Y, %, U, ~B—Y—-Z-U—As—1 


is a positive integer. If this condition (which is essential) is satisfied, then 
the series terminates, and the formula is an algebraical identity. The whole 
chapter, indeed, is essentially a chapter in elementary formal algebra; we 
are concerned, at bottom, with identities between polynomials. From these 
we deduce identities between infinite series, but the passages to the limit 
which are necessary, though some of them have points of interest, do not 
present any serious difficulty to a competent analyst. 

Ramanujan seems to have found the formula about 1910 or 1911, but 
he had been anticipated by Dougall. The formula looks formidable, but 
Dougall’s proofis very simple. Iimagine that Ramanujan argued similarly,’ 
but there is nothing in the notebooks to show. 


If we observe that Gq) = (—- 1) -ay 


(144s) s+2n 
CS 


we can write (7.2.1) in the form 


and 


$,1+4s, —2,-y, —2, -—u,et+yt2t+ut2s+l1 
(7.2.8) Fs 31 
$8,e+8+lyt+st+lz+stlut+et+l, -x-y-z-u-s 
- 1 F(e+s+1)P(yt+2+ut+s+1) 
Lst+Yet+ytetutse+Dayeu Llztutestl) , 
If we write —v for z+y+z2+u+2s+1, so that 
(7.2.4) ety+et+ut+ev= —25—-1, 
then the formula becomes 
1 ~— _ —_ _ _ 
(7.2.5) Fe $,1+ 38, —2, —y, -2, —u, —v fi 
$8,c+s+lytstlztstlut+stl,vt+st+l 
= 1 PatstiTy+e+utst)) 
Ps+I)L@+ytetutst ayeu Tet+uts+l) . 


‘ By such verifications in particular cases as, when associated appropriately, yield 
a rigorous proof, 
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The left-hand side hore is obviously symmetrical in 2, y, 2, w, v. The right- 
hand side is symmetrical in the four arguments , y, z, u, and, since the five 
arguments are connected by the symmetrical relation (7.2.4), it is sym- 
metrical in the five. 

One of the arguments is to be a positive integer. H, say, 

z= mM, 

then the series terminates, and the right-hand side becomes 
(8+ 1) (2+ u+s+1)™(uty+s+ ™ (yte+st 1™ 
(YF St lL) 248+ DM (utst LI (ytetutst Lm 


(7.2.6) 


And if we multiply both sides by 
(y+st+ lL (yt zt utst+1)™, 


then the formula asserts the identity of two polynomials in y, each of 
degree 2m. 
We assume the truth of the formula for 
z= 0,1,2,...,m—1, 
and prove itfor x = m. It is sufficient, after our last remark, to prove that 
it is true for 2m-+ 1 different values of y. 
Now the formula is true for 
y = 0,1,2,...,m—1 
from the inductive hypothesis and the symmetry in 2 and y, and for 
v= 0, 1, 2,..,m—1, 

i.e. for Se 
y =—2s—z—-u—m—1, —2s—z—-u—m—2, 2.., —2s—z-—u—Im, 
from the inductive hypothesis and the symmetry in # and v. Hence it is 
true for 2m values of y, in gencral different, and it is sufficient to verify its 

truth for one more value. We choose the value 

y = —s—m, 
This value of y is a pole of the last term only of the series (7.2.5); and it is 
sufficient to prove that the residue of this term is equal to the residue of 
the product (7.2.6). It is easily verified that each residue is 
mi Stl@ (e+2+ut 1 (2—m+1™ (u—mt YM 


-1 
(-}) (m—1)! (S+2+ 1) (84+ 441M (2+u—m+4 DM’? 


and this completes the proof. 


* I give @ little more detail in $7.7, whero I prove a more general formula which 
includes (7.2.3) as a particular case. 
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7.3. There is a whole crowd of striking particular cases, If we suppose 
that x, y or z is a positive integer, divide by s, write t—w for s, and then 
make u—> oo, we obtain 


~%,—Y, —2 
Coe) Ue eae 
_ P+ ytete+ ly Petett+ YT @ty+ttl) 
Pettt+ 1) Py té+ 1) Petit ly Paty tett+ly 
which gives the value of (7.1.1) when 
Oy +n + aly = By + Py] 
and one a is a negative integer. This result was found by Saalschtitz in 1890. 


Tf on the other hand we suppose that w is a positive integer, divide by s, 
putz = — $s, and then make u->00, we obtain 


(7.3.2) 

( —%, —-Y¥,8 ) PGst+iI@+st+ULlyt+st+ UF @tyt eth) 

e+s+lytstl)” Fist) le+dst lL Fyt+hs4+ D0 etyt+stly 
This formula, due to A. C. Dixon, gives the value of (7.1.1) when 

Oy + Py = y+ fy = Ggtl. 
Tt includes F. Morley’s formula 
1+ (t)'s (“es »y =; in C ia 5 008 den 

for the sum of the cubes of the coefficients in the binomial series for (1 —x)-. 


The formulae (7.3.1) and (7.3.2) are two of the three most important in 
the theory of the series (7.1.1). The third is 


(7.8.3) L(aty+s+1) Pe) 
“" Pet+st1)Pyt+st1) e+s+lytstl 
___ F(@+b+st1) -%, ees, 
~ L@+st)lb+s4+1) @+stl,b+s+1 ] 
This formula of Thomae’s was also rediscovered by Ramanujan. It is not 
a consequence of the Dougall-Ramanujan identity, which is my main 
subject here, but I give Ramanujan’s proof.’ By Gauss’s formula 
Le+y+s+nt+1) a 1 —2, =) 
Petst+n+lyPytstn+1) stn+1) \s+n41 
- 1 oe I(-a+m)IP(-y+m) 
LP(-a)[(-y)rzo mi P(stm+ntl) 


* This is one of the few cases in which Ramanujan gives an explicit proof in the 
notebook. The proof is essentially the same as Thomae’s. 
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Hence 
T(e+yts4+1) (ss —b,x+y+st+ 4 
P(e+st+l)Pyt+st+)) at+s+lyt+s+1 
1 o [(-a+n)I(—b+n)P(a@t+ytstntl) 
~ T(-a)l(-b) to) ni T(e@tstntllyt+stnt)) 


1 
= Ta) T(—b) F(—a)I(-9) 
2 [(-a+m)I(-y+m)I(-a+n)I(—b+n) 
we mint P(s+m+n+4+1) 


and the conclusion follows by symmetry. 


7.4. Treturn to the Dougall-Ramanujan identity. The notebook contains 
a mass of elegant summations, most of which, though not all, can be derived 
from (7.2.1).t I cannot quote many here, but I will say something about 
the formulae (1.2), (1.3) and (1.4) of my first lecture. 

If we suppose w, in (7.2.1), a positive integer, and make u—oo, we 
obtain 


= ” i 3) Tn) 
(7.4.1) ah TD) (s+ 2n) Fa sliess+ He 


_sle+y+e+s+)) I(a@+s+1) 


I(s+l) opel (ytets+1) 
The special cases e=y=-8, 220, 
and eaysz=—s 
give 
3\8 s(s + 1)\8 sin sa 
7.4. - =} 4 1 
(7.4.2) 8 (s+2)() (o+4)/ = ail 


which reduces to (1.2) for s = 4, and 


sey ; sin? sn {I(s)}? 
1.2 ) °°" ~ 27% cossm I'(2s) ’ 


(7.4.3) +(0+2)(5) +(6+4)/ 


which reduces to (1.3) for s = }. 

The formula (1.4) is more difficult, and it is not possible to deduce it 
from (7.2.1). Proofs have been given by myself and by Whipple, the first 
depending on the theory of the Legendre polynomials, and the second on 


* There are a large number of examples in my paper 8, and on p. 96 of Bailey’s 
tract, 
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a generalisation of (7.2.1) due to Whipple himself. Hach proof shows that 
Ramanujan’s series is equal to 


20+() +(34) +--| 


and then sums this series by means of (7.3.2). It would be very interesting 
to know Ramanujan’s proof. 
Another interesting summation is 


1\8 (1.38 4 Tee 
TOME (Gh ence 
Gee (3) * (5 neared 
This also is not a consequence of (7.2.1), but may be effected by moans of 
the two formnlac 


f oP MPL pf mathe 
CC CP se ane CRS 


and ae 
j a( GB, _ PU +a py P+ 3%) 
Gat PAO aer -1) ~ PU -bay P+ da By 


due to Clansen and Kummer respectively. Both of these formulae are in 
the notebook. If we put 
anwf=t 


in (7.4.5) and (7.4.6), andr == — 1 in (7.4.8), we obtain 


15, (1.8) na 
Fay =| 


= {2i(¥-)f = (gna) 


7.5. Another curious formula which has attracted a good deal of 
attention is 


_f Em) im (3) 1.3\2 : 
“ther 1+ 3 1) +... to 2 terms), 


J prove this as a special caso of a more general formula found by Bailoy, viz. 


cay REMC) ()har~ tome 


2 324) m+2° 


_ {E(mt4)Ffl | fi\?_ 1 1.3\? 1 
=| Ga | (:+(5) cit 4) weet to m tori. 
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We can write this as an equation between two terminating series of the 
type ,. For 


pieme | DV ues 1.3\7 1 
—+{-) ——+{>—]} ——3+... to” terms 


m \2) m+1° \2.4) m+2 
__ f1.3...(2%—-3)\? 1 [1+ (2n — 2)? (m+n—1) 
~ (2.4... (Qn —2)f mtn—-1 (2n — 8)? (m+n —2) 


? 
? 
Pima — 4) (m+n—1)(m+n—2) a 
(2% — 3)? (2n — 5)? (m+1—2) (m+n—83) 
1 P(n-4)" pf), —2+1, —n+1, ~m—n+l\ , 
= maETaF 2 ( —n+$, —n+3, -—m—n4+2 ) 
Hence.(7.5.2) reduces to 


1,-n+1, —n4+1, —-m—n+1 
1.8. P(” 
ee) eR ne, | 


1,-m+1,-—m+1,-—m—-n+1 
ioe = 2F > , > 
(m— 4) (ee eens 
Now a :2) = (1-2). aera 2] 
and so 
—n+1,—n+l1 —m+4,-—m+t 
ma —M—N+2 ’ ) A —m—n+l ’ 
2 —m+1,—m+1, —n+h, —n+h. 
= h( ee i) aF( Ante eo 


If we equate the coefficients of 2”*"-1 on the two sides of this equation, it 
will be found that we obtain (7.5.3). 


7.6. Most of the preceding formulae, and in particular (7.2.1), can be 
generalised for ‘‘basic” series. I prove the generalisation of (7.2.1), since 
it is interesting in itself and has a curious connection with the Rogers- 
Ramanujan identities. 

The “‘basic”’ generalisation of the hypergeometric series was first studied 
systematically by Heine. Suppose that, in the hypergeometric series 


ab  a(a+1) A 6+) 
7.6.1 1+ 
a bale vi 1.2.7 +1) 
. 1-9 
we write Toga? 
where 0<q<1, for Ace 
Ban 


' Using the notation of $7.1. 
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(its limit when g>1), and then write 1, m for g* and qg*; and that we also 
introduce a factor g” in the n-th term of the series. We thus obtain the series 
(l-a@) (1 aay (L—a) (1—ag) (1-5) (1-69) 
(1—g)(L~e)* © (1—) (1—9*) (1—-e) (1-eg) 


which reduces to (7.6.1) when g>1. This is the ‘‘basic series” corresponding 
to (7.6.1). 
More generally, we may write 


(a)? = (1—a@) (1~—ag)... (l—ag") 
and 
Gy, Oy, +5 & (Oa)G (Ga)y --- (Gre 
7.6.3 a vo ‘ =r nd” 
(7.8.8) Dy Byy nse) > QE Gap -~ Cale 2 
Ifa, = g%,...,5, = gf,..., and g>1, then (7.6.3) reduces to 
bys Olgy v 2+ OF 
*( ‘13 “2s ’ ) 
Bu Bas 889. Bs 


An additive relation between the a and f, such as a,+/, = 1, will corre- 
spond to a multiplicative relation between the a and b, such as a,b, = q. 
The analogue of the series (7.2.5) is 


(1.6.4) af” 19a, —gaa, 1/b, Le, 1/d, Le, se 
= Ja, —a/a, abg, aeg, adg, weg, af 
Here we have replaced 8,%,Y, 2, U,V 


(7.6.2) 1+ 


Qt... 


by G=4,PR=bq=cFfrdq=e@=f. 
The effect of the four parameters 
qa, — qa, af, — aa 


— ag? 


is to contribute a factor fea 


in the general term of the series, just as the effect of the two parameters 
1+ 4s and 4s in (7.2.5) was to contribute a factor (s+2n)/s. The relation 
(7.2.4) is replaced by 


(7.6.5) a*bedefg = 1, 


and one of the parameters, say 5, must be of the form g™. 
We can transform the product of gamma-functions on the right of (7.2.1), 
and the finite product (7.2.6), similarly. We have 


Py t+ UY Pet+l) _ & Art 2) Ast) 
PA,+YLAg+1) 7 “r (41 +2) (4g +2) 
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when A, +A, = #1 +H, and this is replaced by 


FQ) FQ) 
f(y) f (i2)’ 
where fO= TI —1g"). 
Hence the product of gamma-functions becomes 
f(abe) 
(7.6.6) J (@)f(abede) LL, Fab) flabed)° 


and the finite product becomes 

(aq)q (aqde)? (agec)y (aged)a 

(age)y (agd)y (age)y (agede)y 

and we are led to conjecture that, if (7.6.5) is satisfied, and one of 8, c,d, e, f, 
say 6, is g, then (7.6.4) is equal to either of (7.6.6) or (7.6.7). It is plain 
that (7.6.4) is symmetrical in 5, ¢, d, e, f (since the restriction can be imposed 
upon any one of these parameters), while (7.6.6) is symmetrical in four of 
five symmetrically connected paramcters, and so symmetrical in all of them. 


(7.6.7) 


7.7. We can prove the identity, which was found first by F. H. Jackson, 
by an argument which runs strictly parallel to that of §7.2. We assume 
it true for =1q@,.5¢7 
and prove it for b = g™, Ifb = q™, and we multiply both sides by 

(age)q (agede)?’, 
then it becomes an identity between two polynomials in ¢ of degree 2m, 
and it is enough to verify it for 2m+41 different values of c. Now it is 
true for c= 1,9;9%,...,g%7, 
by the inductive hypothesis and the symmetry in 6 and c, and also for the 
m values of ¢ corresponding to the same values of f. It is therefore enough 
to verify it for one more value of c. 

We choose the value c=alge™, 
which is a pole of the last term only of the series (7.6.4). It is sufficient to 


prove that the residue at this pole is equal to the residue of (7.6.7). The 
residues are, apart from a factor —a-1q—”, the values of the coefficients of 


1 es 1 1 
1—acg” ag™e—a-lq-™ 
when c = a—1y-™; and we have to prove that these coefficients are equal 
for this particular value of c. 
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To calculate the first coefficient we observe that 


= adeq, 


and we obtain 


l-ag (at (ar (ag™)P (1 /d)F (1 /e)s (adeg) om, 
l—a@ (g)P (ag! (g-™1)R-! (adg)™ (aeq)” (1/de)™ 


(7.7.1) 


Since 


(a x (0 % i) ( -4) ae (2 -) = (= 1)rgtnn—2) dm (dg—m+tym 


the terms in 1/e and 1/de may be transformed similarly, and 
(gg = (1-9) (1-4?) ... (1-9) = (— Lym giein (gem), 


l-ag’™ (ale (ag™)r _— (adeqyp (dg ™**)F (eq) 
Lae  (agryR (qm yp (adg)p (aeq)p (deg) 


(7.7.1) is 
Also 
le Cri gc 
l-a (agp 1 -a 
Hence finally (7.7.1) reduces to 


(agqye (adeq)y (dq—™+t)t (eq VR 
(YP (adg)y (aeq)p (deq7m tt)’ 
which is also the coefficient occurring in (7.6.7). 


This completes the proof of Jackson’s identity, which includes the 
Dougall-Ramanujan identity as a limiting case. 


1a) (1—ag)... .—agh-) S—EE  (aa)y. 


7.8. Let us suppose, in particular, that 
a=1, b=q", caod=e=e, fags, 
and make ¢ tend to zero. Then 
(1—e7}) (1—e74g) ... (l—e7lg?-1) 
(1 ~—aeq) (1 —aeg?)... (1 —acg”) 
and similarly for the factors in d and e; and 
C= Paao Ue) 
(1—afg) (1 — afg*) ... (1—afq”) 
After a little reduction, we obtain 


~(=1)rgin Renn, 


~—] ye gq nbn en, 


me sm (gm) (gt). (gently 
ZC Ea eee 


= (I~g) (1-9?) ... (1—@™), 
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which is the first of the two formulae of Rogers proved directly in § 6.16. 
If we take @ = q instead of a = 1, we obtain the second formula. These 
fornrulae lead, as we saw in § 6.16, to a proof of the Rogers-Ramanujan 
identities. The proof thus obtained is a compromise between Rogers’s proof 
given in §§ 6.13-16 and the proof found later by Watson. Watson uses an 
identity which is more complex in form than that of §§ 7.6—7, but which leads 
to the Rogers-Ramanujan identities by a direct limiting process. We use 
the simpler identity, the direct. generalisation of the Dougall-Ramanujan 
identity, to avoid the rather tricky algebra of § 6.16, but retain the more 
straightforward part of Rogers’s argument. 


NOTES ON LECTURE VII 


§7.1. My analysis of the two chapters is in Hardy (8). There I call the chapter 
with which I am primarily concerned ch. xu, but it is ch. x in Watson’s copy of the 
‘second edition’. See Watson, 10, 139. 

Almost all of the formulae discussed in the lecture are proved in Bailey’s tract 
Generalised hypergeometric series (Cambridge 1935). This tract contains a full biblio- 
graphy, and I do not repeat references to papers published before 1914. 

The notation for generalised hypergeomotric series which has become standardized 
was introduced by Barnes, Proc. London Math. Soc. (2), 5 (1907), 59-116. 

§ 7.2. The Dougall-Ramanujan identity is the first formula in ch. x of the note- 
book. Formulae (1.2) and (1.3) of Lecture I are special cases, but Ramanujan does not 
scem ever to have printed the general formula, which I found in the note-book only 
after his death. 

§7.3. The special case of Morley’s formula in which m is a nogative integer had 
been found by Dixon as early as 1891 (and proved more shortly by Richmond in 
1892). Morley published his formula in 1902, and it was his result which led Dixon, 
in the same year, to (7.3.2). Dixon’s original proof was very complicated. 

The corresponding sum for the expansion of (1+ 2)" is a 3, with argumont —1, 
not summahble as a finite product of gamma-functions. 

Formula (7.3.3) is equivalent to (1), §3.2, of Bailey’s tract. It is an expression of 
the theorem that 


I e Oa» *) 
E(B) LBs) P( Bat Ba ~ 0 ~ Ng ~ Mp) bubs 
is a symmetric function of the five arguments 
Bis Ba Bit Bem tym hy, Byt Bar ts—%, Py +fy—ay— ay. 


§7.4, The limit process used in the deduction of (7.4.1) requires a little attention. 
See p. 27 of Bailey’s tract, and the paper of Dougall’s to which he refers. 

For (1.4) see Hardy (4) and Whipple (1). 

Whipple gives two generalisations of (7.4.4) in 3. These generalisations give the 


sums of the series 
plbited-e 4 
wl l4e, lox’ 


Qn 
og Bi dta, b4+20 
Pf L+2, 1422 ’ Us 
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which reduce to Ramanujan’s series when « = 0. The second can be proved by the 
method stated in the lecture. 


§7.5. The first proofs of (7.5.1) were given by Darling (1) and Watson (5); and 
generalisations by Bailey (2, 4), Hodgkinson (1) and Whipple (2). The formula has 
an interesting application to the ‘Lebesgue’ and ‘Landau’ constants: see Watson, 
Quarterly Journal of Math. (Oxford), 1 (1930), 310-318. 

Bailey (4) proved (7.5.2), and the still more general formula 


T(a+m) Py +m) F zy, v+-+m—1 | a Fea RP 2,Yy,v+n—-1 
Tim\Tetytm) > oetytn J], Pln)Tleryen) > A oety4n || 
Here [...], and [...],, mean that only the first n or m terms of the series are retained. 

This reduces to (7.5.2) whonz =y=4,v= 1. 
Formula, (7.5.3) is a special case of formula (1), § 7.2, of Bailey’s tract. Replace 
X,Y, 2 U, V, Ww, nN by 
—-nt+1, -—n+1, 1, —m—n-2, ~—n+ 3, —n+3, mtn-—1. 
Further information will bo found in §10.4 of the tract. 
§7.6. See Heine, Theorie der Kugelfunktionen, i (1878), 97-125. 


$7.7. Jackson, Messenger of Math. 50 (1921), 101-112. See §8.3 of Bailey’s tract. 
Watson’s proof of the Rogers-Ramanujan identities is given in §§ 8.5-6. 
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ASYMPTOTIC THEORY OF PARTITIONS 


8.1. In this lecture I shall be concerned with the question: how large is 
p(n) for large n? It is very remarkable, if we consider how much has been 
written about approximate or asymptotic values of arithmetical functions, 
that this question should never have been asked before 1917, when 
Ramanujan and I published the memoir which is now no. 36 in his Papers. 
We were very lucky in finding a problem which proved to have so much 
individuality and to admit so complete and so surprising a solution. 


8.2. There is one obvious method of attack on any such problem (if no 
strictly elementary method suggests itself). If a, is positive, and the power 
series P(c) = Sa, 


has radius of convergence 1, then there is a general correspondence between 
the order of magnitude of a@,, for large n, and that of F(x) for # near 1. 
The first step, therefore, is to determine the order of magnitude of Euler’s 
function 


1 
(8.2.1) Fle) = (—a) (1-2) (1-2)... 


when 0<%<Jl and x1." This is quite simple, if we are content with a 
rough approximation. For 


log F log — ai = 
oe) me °8 [an == m = Dama 
and ma"-1(1—2)< 1-2” <m(1—2), 
so that 
] xm 1 x 
8.2.2 ee paescerer) Joe 
(8.2.2) Tog pe 08 Fle) <p ae 


Each of the series in (8.2.2) has the limit 47? when x1, and so 


(8.2.3) log F(c) cares 
or 
(8.2.4) Fe) = exp (2 


* I write 2 now instead of g, g being wanted for other purposes. 
HR 8 
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It follows that the order of magnitude of F(z) is, to a first approximation 
that of 


(8.2.5) exp 


2 


2 


pein 
6(1—2)" 
We want to know the order of a,, corresponding to this order for 
F(x) = La,,2". 
Ifa, = 2%, where «> —1, and a = e~¥, so that y+0, then 
F(a) = Lnra” = Lnrery ~ | "re-vat = ao eel 


On the other hand, if a, were as large as e'”, for some positive 6, then the 
series would diverge before # reaches 1. It is plain then that a, must be 
smaller than this, but larger than any power of x. It is natural to conjecture 
that the right order is about eBnd 


for some 6 between 0 and 1 and some B. 
The order of G(x) = LeBregn — LeBnb—ny 


may be calculated roughly from that of its maximum term. This occurs 
when Bin’ = y, approximately; and the maximum term is then about 


b 
exp{C(1—2x) 1-4, 
le AB 


where C = B->b1-4(1 —8), 


This agrees with (8.2.5) if 6 = 4 and B? = 27%; and we conclude that the 
order of (nz) should be about 


eknt 
where 
(8.2.6) K =nJ(2). 
8.3. It is actually true that 
(8.3.1) log p(n) ~ Kn? 
or 
(8.3.2) pln) = elK+on)ink 
but we cannot prove this very simply. It is however quite easy to prove that 
(8.3.3) ednk < (n) < eBnt 


for n>0 and some positive A and B. 
If x = e-¥, then y>0 when z+], and 


(8.3.4) l-any. 
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We write 
(8.3.5) F(x) = Xp(nje-™ = Gy); 


and we suppose that 
(8.3.6) 0<E<C air’ <D. 


(i) It follows from (8.2.3), (8.3.4) and (8.3.6) that 
p(nje-*Y < Gly) < expy 
for all n and small y. Hence, taking y = n-?, we obtain 
p(n) <exp (vy + 2) = But, 


with B = D+1, so that B may be any number greater than C+ 1. 
(ii) On the other hand 


E 
Qty) > — 
{y) are 


for small y; and so 
m oy E 
Gly) + Galy) = Dome + Ze plwjem" > exp 7 


meL 
for every m. But p(n) increases with n, and therefore 


(yn-+ 1) p(m)> y(y) > exp = — Gy). 


Also Go(y) = X p(nje™ < Y exp (Bnt—ny), + 
ml m1 
after what we proved under (i); and so 
1 Thin) 
Se agen cn 
p{m) spre 7 |e=P id eeeP (Bn na). 
We choose H so that H>2B, and take y = Hm-*, Then 
x exp (Bnt—ny) s 3 exp (But —2Bnm-?) < z erBnt < L, 
m+1 m+1 0 


say. It follows that 
1 


E 1 E 
a = =m + 
p(m)> (exp 7 r) rarer (esppm x) >exp Am’ 
for large m, A being any number less than H/H. Since H may be any 
number less than C, and H any number greater than 2B = 2(1+D), ie. any 
number greater than 2(1+C), A may be any number less than $0/(1+C). 
8-2 
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8.4. To prove (8.3.1) or (8.3.2) we require a general theorem of the 
so-called “Tauberian”’ kind. It is in fact true, whenever a, 2 0 for all n, that 


Cc 
log Xa,,x” ~ ie 


implies log A, = log (@y4+a,+...+@,)~2(Cn)3. 

When, as here, a, increases with m, we can replace the last equation by 
log a, ~ 2(Cn)}, 

and this gives (8.3.1). 

We have thus an asymptotic formula for the logarithm of p(n). But an 
asymptotic formula for the logarithm of an arithmetical function is a very 
crude result; it does not distinguish, for example, between the orders of 
magnitude ntl000gknt 


We should like, at the least, an asymptotic formula for p() itself. Actually 


1 
7 Kn. 
(8.4.1) p(n) in [3° ; 


but we cannot hope to prove so much as this by arguments of so elementary 
and general a kind. It is essential to use more powerful methods which 
pay proper regard to the more intimate peculiarities of F(x). 


8.5. Our natural resource is Cauchy’s theorem. This tells us that 


(8.5.1) pln) = = ae ’ 


where C is a contour round the origin. We must move C into the most 
advantageous position and study the integral directly. There is of course 
nothing in the least novel in this idea, which is that which dominates the 
whole analytic theory of numbers, and in particular the theory of primes; 
but the setting here is quite different, and it is instructive to compare the 
two problems. 

In the theory of primes our generating functions were Dirichlet’s series 
Za,n-*, and the proof of the Prime Number Theorem depended upon the 


integral ee =e o (6) 5, 
Qi} eto (8s) 8” 
where c> 1. We moved the contour of integration to the left, across the 


pole at s = 1, and inferred that y*(a) and y(x) differed from the residue x 
at the pole by an error of smaller order than x. 


* See Lecture I, 
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The conclusion was correct, but the argument was difficult to justify 
because {(s) behaves in a very complicated way at infinity. In particular 
the location of its zeros is still highly mysterious. On the other hand there 
isno difficulty at all about the singularity which yields the dominant term, 
a pole at ¢ = 1 of the simplest possible character. 


8.6. The singularities of the F(z) of our present problem are very much 
more complicated. They cover the unit circle |x| = 1. The circle is a 
“barrier” for the function, which does not exist outside it, and there can 
be no question of “moving C across the singularities”. All that we can 
hope to do is to move C close to the singularities and study each part of it 
in detail. 

For all this, however, there are strong consolations. The function F(x) 
is one of a well-known class, the elliptic modular functions, whose properties 
have been studied intensively and are very exactly known. These functions 
all have the same peculiarities as (x), and exist only inside the circle; but 
they satisfy remarkable functional equations which enable us to determine 
their behaviour, near any point of the circle, very precisely. In particular, 
F(z) satisfies the equation 


8.6.1 ree ie), 7s Fea! 
(8.6.1) (2) = Sri 85) °xP ear oe 
where 
] 1 _ 2 ae eee 
(8.6.2) log log = 4m, ~ exp| log (1/z)!" 


If, for example, x is positive and near to 1, then a’ is extravagantly small 
and F(a") is practically 1; so that (8.6.1) expresses F(x), effectively, in 
terms of elementary functions. There are similar formulae associated with 
other points of the circle, such as 
— Left, e-antj, 4, etmt, 

(generally, with all primitive roots of unity); but (8.6.1) alone is enough to 
enable us to make great progress. 

Jn particular, if we take C to be a circle with just the right radius, a little 
less than 1, we can substitute from (8.6.1) into (8.5.1), and replace F(z’) 
by 1, with an error which turns out to be of order 


Ent 
enn. 


where H<K =7J(2). 


There are then only elementary functions in the integral, and we can 
calculate it very precisely. 
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The result is the formula 


d [eK&n 
(8.6.3) pO) = 55-75 a) toe 
where 
(8.6.4) A,=HV(n-&), H<k. 


This includes (8.4.1) and is very much more precise. The form of the 
dominant term is at first sight rather mysterious, but it arises naturally 
from the analysis. In particular, the “—j,” in A, arises naturally from 
the index 4 in (8.6.1). 


8.7. This however is by no means the end of the matter. The formula 
(8.6.1) is that appropriate for the study of F(x) near x = 1. There are, as 
T remarked, similar formulae associated with other ‘rational points” 

(8.7.1) pq = ePrila 
on the unit circle. One may say (naturally very roughly) that these “rational 
singularities” are the heaviest singularities of F(x), that P(x) is bigger near 
them than near other points of tho circle, and that their contributions to 
the integral (8.5.1) may be expected to outweigh those of other points. 

Further, these rational singularities diminish in weight as q increases. 
When x->1 along a radius, #'(x) behaves roughly like 


erlea= a 


while when «->2, , it behaves roughly like 


rare 
exp(—;———}. 
P Yoga =| zn} 
It is reasonable to expect that 
(8.7.2) p(n) = P,(n)+P,(n) +... +Po(n) + Rm), 
where P,(x) is the dominant term in (8.6.3), P,(m), Pa(n), .... Po(m) are similar 
in form, but with smaller numbers K,, Ky,...,Kg in the place of K, and 
R(n) is an error of lower order than Kon, 


The proof of all this can be put through without much additional 
difficulty. The form of P,(n) is 


(8.7.3) Ein) = D,(n) b(n), 
where 
Enh 
(8.7.4) ggln) = hs io) ; 


(8.7.5) D,(n) = Yi Wy, ge? 72, 
Pp 
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p runs through the integers less than and prime to g,' and w,,, is a certain 
24th root of unity. Thus . 
Lwj=1, difn)=FAm), = z 


And 
(8.7.6) Rin) = O(eHen), 
where Hy<Kg 


(so that Hp>0 when Q->0o). We can thus find p(n) with error O(e™*) and 
an arbitrarily small positive é. 


8.8. At this point we might have stopped had it not been for Major 
MacMahon’s love of calculation. MacMahon was a practised and en- 
thusiastic computer, and made usa table of p(n) up ton = 200. In particular 
he found that 

(8.8.1) p(200) = 3972999029388, 


and we naturally took this value as a test for our asymptotic formula. We 
expected a good result, with an error of perhaps one or two figures, but we 
had never dared to hope for such a result as we found. Actually 8 terms of 
our formulae gave p(200) with an error of 0-004. We were inevitably led 
to ask whether the formula could not be used to calculate p(n) exactly for 
any large n. 

It is plain that, if this is possible, it will be necessary to use a “large” 
number of terms of the series, that is to say to make @ a function of ». Our 
final result was as follows. There are constants a, M such that 


(8.8.2) pia= L Fal) + BO), 
where ne 
(8.8.3) | R(n) |< Mn-, 


and, since p(n) is an integer, (8.8.2) will give its value exactly for sufficiently 
large x. The formula is one of the rare formulae which are both asymptotic 
and exact; it tells us all we want to know about the order and approximate 
form of p(n), and it appears also to be adapted for exact calculation. It 
was in fact from this formula, that D. H. Lehmer first calculated the value 
of (721). 


8.9. It was however necessary, until very recently, to make a curious 
reservation at this point. The values of (200) and (243) were known, 
because they had been calculated directly by MacMahon and Gupta, but 
calculations based upon (8.8.2) were not decisive. We cannot use the formula 


* When g=1,p= 0. 
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to prove that (721) has a particular value until we have found numerical 
values for a and M; Ramanujan and I had merely proved their existence. 
It was necessary to go over all our analysis, give numerical values to all 
our “constants”, and replace all our “O” terms by terms with numerical 
bounds. In the meantime Lehmer’s calculations, which used 21 terms of 
the series, and led to the value 


1610617557502794776355347 62-0041, 


gave a very strong presumption about the value of p(721) but were not 
conclusive. 

The gap has now been filled by Rademacher, who (trying at first merely 
to simplify our analysis) was led to make a very fortunate formal change. 
Ramanujan and I worked, not exactly with the function 


ld (em 
$(m) = 58 ala): 
but with the “nearly equivalent” function 
ld one t) 
m2 dn An 
(afterwards discarding the less important parts of the function). Rade- 
macher works with 
b(n) _ 1 a amet) 
~ tAl2 dn Aan) 
which is also “nearly equivalent’’; and this apparently slight change has 
a very important effect, since it leads to an identity for p(n). 


We have 
@/l_., KA) d (Kk , K(n—3) eMail 
sny,sith q Iealaé 6g tes o(;5) 
for fixed n and large g. Hence the function 
+ d {sinh (KA,/¢) 
8.9. =-1 2 
(8.9.1 ven) = Le 5 eal 
behaves for large q like a multiple of g3.g-3 = q-!; and 
(8.9.2) | Zelm) | = |Zepgeerria| <a, 
so that . . 
(8.9.3) LL y(n) y(n) 
q 


is convergent. The series XL,(n)¢,(n) is not convergent; this question 
Ramanujan and I left doubtful, but it has been settled since by Lehmer. 
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Rademacher proved that 


oO 
(8.9.4) p(n) = 5 Le(n) elm) 
and that the remainder after Q terms is less than 
Q\t . , Knt 
+ ma pile 
CQ +D(2) sinh Q” 


where C and D are constants for which he found definite values. The 
remainder is of order »-? when Q is of order n}, as in our older work. 


Proof of Rademacher’s identity 
8.10. The rest of this lecture will be devoted to the proof of (8.9.4). 
The Farey series $y of order N is the set of irreducible fractions p/g, 
between 0 and 1, whose denominators do not exceed V. We include 0 and 1 
in the forms 2 and +: thus 3, is 


O21 1 2 323 4 1 
> &> hs Bo 6? 3, Bo 32 42 B 1° 


Tf “= pennt8 
and @ runs through the values of $y (so that the first and last values of x 
are each 7), we define a set of ‘‘Farey points”’ re??#2 on the circle |x| =r. 
Suppose that g>1 and 5 F 
pp g np" op 


7) 2 


en aes 
are three consecutive fractions of $y. We associate with p/q¢ the interval 
Eng OF 


Pp ” p ’ 
q \P a q | Xe 
1 1 
bh i = 3 ff ar eT ET se 
bar Xpa = gq") Xo gq+q') 


These intervals just fill up the interval (0,1), and the length of each of the 
parts into which £,, is divided by p/q lies between 


I 1 
2Nq’ Ng 
The definitions naturally require modification when g= 1. Then p/q is 
¢ or + and the interval has one part only. 

We shall also use &,, for the arc of |x| = 7 defined by the same values 
of @; the two extreme arcs, which now abut at a = 7, are to be amalgamated 
into one. We call this dissection of the circle the Farey dissection of order N. 

We apply (8.5.1) to the circle C' defined by 


fx] =r = en erlN!, 
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We write 


Gitimpiee ems ee 


= di = 35 
, aL , REL DQ? 
2m) x palm) g,% ‘De 


and study each integral separately. The range of summation is defined by 
(8.10.2) O<p<q, (pg)=1, 1lSqsN 


(except that, when g = 1, p assumes the single value 0). 


8.11. Ong, we write 
(8.11.1) we retro, = e-BnIN?, g=F+9 
so that 
Qpont | 2pm Qnez 
(8.11.2) 2= exp| : ~ 2a ig)|= exp("E™ =), 
where 


(8.11.3) «= a(ya-'#). 
Thus ¢ = 0 gives the Farey point of £,,,, 

(8.11.4) ~X = —Xpg SBS Xpg = 2X’ 
and 

(8.11.5) sao <aap |sl<zy- 


We also collect here some simple inequalities which will be useful later. 
Since |q¢|<1/N and ¢<N, we have 


(8.11.6) |2| =9(N-4+ g?)8 s (G?@N-44 N-*)t 5 DIN} 
(so that | z| is small, uniformly, for large WV); also 


id 7 
a exp(-7)| = exp(-739) <) 
an 
lal N- N- 
8.11.8 —h- Zt. 
vee qa PWS) PNA ENF 


8.12. We now use the formula of the type (8.6.1) but associated with 
eprtld instead of with 1. This formula is 


= 4 ST TTR: ! 
(8.12.1) F(x) Wpq@ EXP ( 124 F(x’), 
where 


(8.12.2) x= exp (2-2), x! = exp (27-2), 
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nq 18 the 24th root of unity referred to in §8.7, 2? has its principal 
value, and pp, =— 1 (mod q). The formula reduces to (8.6.1) when g = 1. 
Tf we substitute from (8.12.1) into the integral j,,, of (8.10.1), write 


ria z 
(8.12.3) Vz) = W,(z) = ztexp (eet 


and observe that 


Qnm  Qnpre_ one ai) 
qg a 


we obtain 


x 
(8.12.4) e2mniV4 | = wy ge-nnila | P(2) Fla’) eBanet dg. 
—x" 


Tt is convenient to have also a formula for j,,. in which the variable of 
integration is z. This formula, which is a trivial transformation of (8.12.4), is 


(8.12.5) Sng = + ty ge ori | P(e) Flee’) rele de, 
the path of integration being the straight line joining the points 
Ley eee 
Gat ix js Ia tx 


pa 


in the plane of z. 


8.18. When z is small and positive (i.e. when x is near to e?”7#2, on the 
radius to that point), | a! | = er Bale 
is extremely small, and F(2') is practically 1. We may hope to replace 
F(x’) by 1, without serious error, on the whole of &,,. We therefore write 

(8.13.1) Joa = pat Ing 
and 

(8.13.2) P(N) = Bigg = 2pqt+ LI pq = P(n)+ P'(n), 
say, Jp and J; being the integrals obtained from j,, when we replace 
F(a’) by ' 

1, F(x’)-1, 

respectively, Of course P(n) and P’(n) depend upon WN as well as », and 


we have to study their limits when Noo. 


8.14, We prove first that 
(8.14.1) P’(n) 0. 


Wehave W(2){F(a')-=at exp{~5-(e-3)| 3 plv) 2”. 
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Now | e-m#?2a| <1, 
by (8.11.7). Also 

| enl2gzy.'y | = 


7 2pm =\\ 
ie times . ( q 


Qn 1 1 
= exp{- 2 y- ay Rz| seeder, 


by (8.11.8), so that 


co 
e7/12gz > pv) gz” 
1 


s > ptr) ee) = B, 
1 


where B is a constant. Finally, 

jz|#s tn, 
by (8.11.6), and the interval of integration in (8.12.4) is less than 2/qN, 
by (8.11.5). Hence 


; Bes aie 2tB 
| Jog | <2 eae +B = en NY 
a ats o(v-4y7) = o(w- zr 1) = O(N-). 
pad gan 


Combining (8.14.1) and (8.13.2), and remembering that p(n) is inde- 
pendent of V, we see that 


(8.14.2) p(n) = lim P(n). 


8.15. In discussing P(n) we use the formula (8.12.5), with F(x’) replaced 
by 1. Putting 


2=qZ 
we obtain 
(8.15.1) Tyg = Op ge PM Ry gs 
where 
(8.15.2) Kees Bexp|—" 4+ 2n(n—ay) ZB dZ 
Ds A 12¢2Z 24 * 


The path of integration is now the line L of Fig. 2. Thus 


N N 
(8.15.3) p(n) a =lim > Yo, Rye rre = lim YT, 


bh N-wog=1 yp N~>0 g=1 
where 
(8.15.4) T = Dp _Ry ge Pra, 
Pp 


We have now to transform R,,. We apply Cauchy’s theorem to the 
contour indicated in the figure. Here Z is positive at N-®, is —4(— Z)}, 
where (— Z)* is positive, on the lizie (1), and 7(— Z)* on (6). Also 

e<N~ 
and ¢ will be made to tend to 0 before W tends to oo. 
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Cauchy’s theorem gives 


Oe eee Ney eared re, 
Ne?) eee ee al: a Ja Ja Jo Jo 


= 27, +igt(h+Lh+h+4+5+4), 
say, where U, is an integral from —oo to —co enclosing the origin in the 
positive direction, and is independent of p." 
It is plain that 


(8.15.6) Lapses, 
where 
= W 
5.7 = t eee aie 
(8.15.7) V, [, t exp| lag 2a(n #e)t) dt 


Fig. 2. 


(so that V, also is independent of »), when e+ 0. If we assume provisionally 
that the other integrals = J,, I,, and J, can be neglected, then we shall have 


p(n) = lim 3 7, = lim lim $7, 


No g=1 N09 6>0g=1 


= lim lim os {29tU, + igt(L,+ 4) 3B, e-Bnpatia 


No e+ 0g=1 


= lim z 2g! UT, +Y) page ee 


N->og=1 
= lim 23 gLa(T,+%) = 23, oi (U,+%,). 
N->o g=1 
Here I, = L,(n) is defined by (8.7.5). And if we can then prove that 
y a {1 KA, 
(8.15.8) U,+¥, = Bana" =, 


the proof of Rademacher’s identity will be completed. 


* R, depends upon :p as well as on g, because x’ and x” do so; and so do I,, Ig, 14, Z;. 
But U,, V,, and the upper bounds found for J,,..., 2; in § 8.16, are independent 
of p. 
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8.16. We must first verify the provisional assumption made in $8.15. 
It is sufficient to prove that 


lim sa Zlim|h+h+h+h| =0, 


N->og=1 
or that 
(8.16.1) tim 5 alii Max | + hh it+4,| =0. 
In £, Z=-—6-tY (0<¥<x’), 
RIZ) <0, (3) =-app<? 
#) 
and | Bl < (+x 2x’ < (e+ ain) wv 
Plainly J, satisfies the same inequality, and 
(8.16.2) lim | L+4,| $2(¢N)-? 
(for each 9). sas 
In Js, Z=X-in' (-e<X<N-*), 
| Z|t< 24g 4-4, 
by (8.11.6); | e2mn—FdZ | < einai? 
Z = z Fe xs ie a 


by (8.11.5); and, for each p, 
(8.16.3) | | <2N-2. 2tg-EN-+, e2elN* ott < BN Sg teanai?, 


where B is again a constant (in particular, independent of p). Plainly 4 
also satisfies the same inequality, and so 


(8.16.4) lim | + 4| <2 BN—-tge2n7!**, 
e>0 
Thus it is sufficient to prove that 
N 
x g(@W)y #0 
g=1 
N 
and xg. Nig 0; 
g=1 


and each of these is O(N'-*). 


» 
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8.17. It remains only to evaluate the integrals UJ, and Vi, and so prove 

(8.15.8). The integral U, is one of a standard type, and may be calculated 
by formulae to be found in Watson’s Bessel functions. We have 


1 pat) 
Lid (One 
pda ee Sie ag + n(n ~#) 2} az 
oi see dia ee ole 
Tivo Bu “Sale gy) 2B? 
ai ai) 
, + t—‘_}dti, 
we obtain mE al i gs 
2722, _ Hn 
7 2 oe @ on j—2: 
with = 3q2 > = tm / (3) 3 q > 
and this is 
id 1 1 d/l AAs) 
pace E h—), 
Ce pas (0) = Qn wilt ee 
1 d/l KA, 
so that U, = on f8 ali +- cosh — =). 


As for V,, we have 


2 1 
= ey -—= =, 
A {, texp | [aq 2a(n — xz) i dt 


1 d f® 7 
=“ | ——__onin— ; 
Sara t oxp| [og 2n(n ~ Fy) i dé 


-) 
Since ena bet d= Jt entab 
0 2a 


when a and 6 are positive, this gives 


Balt 
pa (==): 


27 f2dn\ A, 
iL Gee eR 
a tle ae Bala hg) 


which is (8.15.8). 


8.18. It is easy to estimate the error involved in taking only N terms of 
the series. The value of N which we choose will depend upon n, and we must 
now use approximations valid uniformly in n. In the first place 


| L,(n) | S¢ 
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by (8.9.2). Also 
d[1_. Ka,\  @(K K%\n—4) 1 
last q ) ae 698 +-.)=0(5) 
if n< Ag’, 


for a fixed A, and q is large (and uniformly in any such range of x). The 
error will involve all values of ¢ greater than N,so that N must be at least 
of order n}. 

This heing so, we have =, (2) = O(g7#) 


for g>N, and xz Lyn) oy(n) = of s r) = O(N-#). 
N+1 N+ 


In particular, if we take NV of order x, we obtain p(n), for large n, with 
an error O(n~t). 

A rather closer analysis is required for numerical computation, hut there 
is no difficulty of principle. Thus Rademacher, using only the crude estimate 
(8.9.2), found that the error after N terms is less than 


= sinh in 

n N 

(with definite values for B, and B,). If we wish to use ¢,(m) instead of 
7,(n), as in the Hardy-Ramanujan formula (and this is more convenient), 
then there is a slight additional error which is easily estimated. Thus 
Rademacher was ahle to show that if n = 721 and N = 21, the error is less 
than 0-88, Since p(n) is an integer, any bound less than }$ is good enough 
to determine p(n). 

A cruder estimate will suffice for special purposes. These calculations, for 

example, were undertaken with a view to deciding whether 

(721)=0 (mod 11%) 
in accordance with one of Ramanujan’s conjectures. Since Ramanujan 
proved that (721) is a multiple of 11°, any hound for the error less than 
60:5 is sufficient for this purpose. 

I have set out in tahular form the values of the first 21 terms of the 
Hardy-Ramanujan formula.‘ The rapidity of the convergence is very 
impressive, and the actual error is much less than that given hy Rade- 
macher’s analysis. The largest value of p(n) which has been found in this 


way is (14031) = 92 85303 04759 09931 69434 85156 67127 
75089 29160 56358 46500 54568 28164 
58081 50403 46756 75123 95895 59113 
47418 88383 22063 43272 91599 91345 
00745, 
* The corresponding table for m = 14031 is too long to print conveniently. 


B,N++ B, 
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recently computed by Lehmer. This requires 62 terms of the series, and a 
more refined estimate of L,(n); the crude inequality (8.9.2) is insufficient. 
Lehmer has, however, shown that 
(8.18.1) | L,(n) | < 29%, 
and this was enough for the purpose. It turns out that 
(14031)=0 (mod 114), 


as predicted by Ramanujan. 


q P{721) 
1 161061755750279601828302117- 84821 
2 —124192062781-96844 
3 — 706763: 61926 
4 2169-16829 
5 0-00000' 
6 14-20724 
7 6°07837 
8 0-18926 
9 0-04914 
10 0-00000" 
11 0-08814 
12 — 0°03525 
13 0-08247 
14 — 0:00687 
16 0-00000" 
16 —0-01133 
17 0-00000" 
18 — 000653 
19 0-00859 
20 0-00000" 
21 ~0-00524 


161061755 750279477635534762-0041 


! These terms vanish identically. 


NOTES ON LECTURE VOI 


§8.1. Most of the material of this lecture is to be found in Hardy and Ramanujan, 
Proc. London Math. Soc. (2), 17 (1918), 75-115 (no. 36 of the Papers), and Rade- 
macher (2). 


§8.3. Hardy and Ramanujan (J.c. 285-287) prove the more precise inequalities 
Hn- ent < p(n) < Kn exant 
by elementary methods. 


§8.4. The Tauberian theorem is proved, in a more general form, by Hardy and 
Ramanujan, Proc. London Math. Soc. (2), 16 (1917), 112-132 (no. 34 of the Papers). 
There are references at the end of this paper to similar theorems of Valiron and earlier 
writers. 
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The ‘Tauberian’ methods are naturally effective, so far as they go, over a wide 
range of problems. Thus we can prove that the number of partitions of x into primes is 


“(alee 


with the same degree of accuracy as in (8.3.1) or (8.3.2). 


§8.6. The functional equations for F(x), required here and later (in particular in 
§8.12), are the formulae for the linear transformation of tho function A(r) of Tannery 
and Molk. See Tannery and Molk, Fonctions elliptiques, ii, 264-267 (tables XLV- 
XLYI). 

The formula (8.6.3) was found independently by Uspensky, Bull. de lacad. des 
setences de VURSS (6), 14 (1920), 199-218. Uspensky’s paper was published a 
little after ours, and we developed the solution much further, so that his proof of 
(8.6.3), which is simpler than ours, has been noticed less than it deserves. 


§8.7. The abstract of no. 36 which appeared in 1917 in the Comptes rendus (no. 31 
of the Papers) does not advance beyond this stage. 

The idea of approximating to an arithmetical function by a ‘singular series’, in 
which each term corresponds to a ‘rational point’ on the circle of convergence of the 
generating series, is one of these which dominate the work of Hardy and Littlewood 
on Waring’s problem. 

Various expressions for w,, have been given by Hermite, Tannery and Molk, 
Hardy and Ramanujan, and Rademacher (1). Rademacher’s is 


Ono = ern, 


q-i 
where 8yq= EE > (2 - [= ]-3}- 
Guar \G qj 2 
D. H. Lehmer (4) has proved that Z,() has a ‘multiplicative’ property which 
enables us to reduce its calculation to cases in which q is a prime or a power of a prime, 


and that in these cases it is the product of g3, a power of 2, @ symbol of quadratic 
residuality, and the cosine of a rational multiple of 7. 


§§8.8-9. MacMahon worked with the recurrence formula 
p(n) —p(n—2)—p(n—3) + p(n—5)+... = 0 
obtained by equating coefficients in the identity 
(L-—2®-a8 405+...) Lp,0" = 1, 


Gupta (1, 2) used additional devices. 

Lehmer calculated p(721), from the Hardy-Ramanujan formula, in his paper 1, 
but the result was first made decisive by Rademacher (2). Both Lehmer and Rade- 
macher also calculated (699), as a test of Ramanujan’s conjecture (§ 6.6) for modulus 
54 (the result being again affirmative). In the meantime Gupta (2) had verified the 
value of (599) by direct computation. 

In his paper 3 Lehmer finds the values of p() for 


n = 1224, 2052, 2474, 14031 
and verifies their divisibility by 


54, 113, 55,114 
respectively. 
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Rademacher and Zuckermann (1) and Zuckermann (3) have found identities for 
the coefficients in other modular functions. Some of these functions are of the same 
type as F(x), while others are of a different type also considered by Hardy and 
Ramanujan [Proc. Royal Soc. (A), 95 (1919), 144-155 (no. 37 of the Papers)]. 

For the divergence of the Hardy-Ramanujan series see Lehmer (2, 4, 5). The first 
of these papers settles the question of divergence, while in the two latter he proves 
more precise results. 

$8.10, For the relevant properties of Farey series see, for example, Hardy and 
Wright, ch. m, or Landau, Voriesungen, i, 98-100. 

§8.17, See Watson, Bessel functions, ch. vi, especially p. 176. 


§8.18. For (8.18.1) see Lehmer (4), 292. 


IX 


THE REPRESENTATION OF NUMBERS 
AS SUMS OF SQUARES 


9.1. The problem of the representation of an integer n as the sum of a 
given number & of integral squares is one of the most celebrated in the 
theory of numbers. Its history may be traced back to Diophantus, but 
begins effectively with Girard’s (or Fermat’s) theorem that a prime 4m+1 
isthe sum of two squares. Almost every arithmetician of note since Fermat 
has contributed to the solution of the problem, and it has its puzzles for us 
still. 

We denote the number of representations of m by k squares, i.e. the 
number of integral solutions of 


P+ot+..4+eR=n, 
by r,(n). We are to pay attention to the sign and order of 2, 2», ..., %,. Thus 
La (£140? = 04 (41), B= (42) (41)? = (41)4 (42% 
and rl) = 4, r9(5) = 8 


The problem is that of determining 7,,(n) in terms of simpler arithmetical 
functions of n, such as the number or the sum of its divisors. It is a good deal 
easier if k is an even number 2s, and I shall suppose this throughout the 
lecture. 


Jacobi solved the problem for 2s = 2, 4, 6 and 8. Thus he proved that 
(9.1.1) rAlm)= 4 (-1)KE-D = Afdy(n)—da(n)}; 


dodd,d|n 
and that 
(9.1.2) r4(n) = 8d = 8e(n), 
a[n 
or 
(9.1.3) v(m) = 24 YY d = 240°%n), 
dodd,din 


according as ” is odd or even. Here the sums extend over all divisors d of n, 
or all odd divisors; d,(m) and d,(n) are the numbers of the divisors of n of 
the forms 4m-+ 1 and 4m + 8 respectively; o(n) is the sum of the divisors of 
n, and o°(n) the sum of its odd divisors. The formula for ,(7) is a little more 


complicated, but of the same general character, and that for 7,(m) will 
occur later. 
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9.2, Jacobi found his ‘formulae from the theory of the clliptic theta- 
functions. If 

(9.2.1) Ha) = 14+ 2e+2rt+..= > am, 
then : 7 

(9.2.2.) 8%) =14 4( ae eee re z +..J 
and 

(9.2.8) (x) = 14 a( 


Ey Qa & 323 i 4x4 ) 
ioe fae leet ee SY? 


and (9.1.1), (9.1.2) and (9.1.3) are the results of equating coefficients. Thus 
the right-hand side of (9.2.2) is (ignoring the 1) 


at sc . 
— 1)Ka- = _- 1) K@-D gid. 
£y (ae a4 BS (1a, 


dodd = dodd i=1 
and the coefficient of x” is 
4 SY (—1}@-, 
dodd, d\n 
In this way the formulae for 7,(n) and r,(2) appear as corollaries of an 
analytical theory. 

It is possible to reverse this procedure, to prove the arithmetical formulae 
directly and deduce the analytical identities; and here there is some differ- 
ence between r,(7} and r,(n). It is easy to deduce (9.1.1) from the theory 
of the Gaussian complex integers, but (9.1.2) and (9.1.3) present greater 
difficulties. For this reason it is interesting to have an elementary deduction 
of (9.2.3) from (9.2.2), and Ramanujan found one. I repeat it here because, 
though it has very little bearing on the substance of my lecture, it is a very 
characteristic specimen of Ramanujan’s work. 

It is easily verified that 


x ee 2x? = B03 dart _ 
l-~a@' l4a? 1-938 '1l+at' 
_ & Qu? 323 5S * 
Loe To 1a beue = LMU, 
: am 
where Ua eae 


and the dash indicatcs the omission of multiples of 4. We have therefore 
to prove that 


(9.2.4) (£4 uy —Ugtug—...)? = A+ gl'mu,. 
Ramanujan proves, more generally, that 
(9.2.5) S=7+7, 
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where S = }cot40+u,sin@+u,sin 26+..., 
T, = (A cot $6)? + u,(1 + uy) cos 6 + Wa(1 + Uy) cos 264+..., 
T, = Hu,(1 —cos @) + 20t9(1 — cos 26) + 3ug(1 ~ cos 30) +...}. 
This reduces to (9.2.4) when 0 = 4a. For then 
S=t+u—uUgtus—..., 
Ti=yet % (-1)" tom(1 + Yon) 


2m 


= f+ (-)" 


ao ao 
Sl — lyn 2mn 
maa) (Teme pe ae 


ner Anan ) 


13 
and Ty = 4 (uy + 8ug + Sug +...) + Quy + 6ugt+ 10uig+..., 
so that T4+T, = p+ he" muy. 
To prove (9.2.5), we write 


wo 3 
S= (: Cot sO + Y Um, sin m8) 
T 
i) -) 2 
= (£cot40)?+ dct $0 > u, sin md + (3 Un, SI md) 
T 1 


= (4cot$0)?+8,+ 8, 


say. We express 8, and S, in the forms 
8,= x {$+ cos0 + cos 20+ ...4.cos (m—1)0+4 cosmo} u,, 
T 


ao co 
8, = 2 iq 8in mo Dw, sin nd =} 


vhs 


z {cos (m—n) O— cos (m+ 2) Ph Up Un, 
1 


and rearrange S,+ 8, as Bs 
8, +8, = YC, coské. 
k=0 


(i) The contribution of 8, to Cy is 4Xu,,, and that of S, is $2u2,. Hence 


= 2B Um(1 + Un) = 2h (l— 2)? = th hnem 
¥ 
1 


(ii) If k> 0, then the contribution of S, to C, is 


0 a 
But DL lin = Het D tase 
k+1 I=1 
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That of 5, is 


a k-1 
= Rae 
LOS tata th Do Untaad mln = Dy Uk — 3 De WME 
“m—n=k n-m=k mtnak T=] l=1 
a ro) k-1 
1 
Hence = 4u,t+ Do Upeat 2 Uther 2 Mitt 
i=1 i = 


It is easily verified that 
Ups L + ty) = Up (ty — Uysrs Mbp y = MeL t+ Up) 


so that 
ke 


oo —1 
p= tab + Style) 25 (1-4 +n.) 
= {b+ Uy t Ugt ... tu — Mk 1) — (Uy tUgt-.. +Up-a)} 

= (1 +u,— 4h). 
Hence finally 


S$? = (Loot 10)24+4 


tis 


Ny, + YU, (1 + uy, — 4h) cos KO 
FT k=1 


= (kcot $0)? + 3 Ug (1+ Up) COS MO + $ Xi MUry| 1—cos m@) 
mss m= 
=T14+%, 


The identity is equivalent to 


ke ai ra er (=) ( ~ tet : ai = cos 


e 


in the ordinary notation of elliptic functions. 


9.3. Jacobi did not attempt to determine r,,(m) when 2s exceeds 8, and 
the first results in this direction, for 2s = 10 and 2s = 12, were found by 
Liouville and Eisenstein. In these cases r,,(m) is not usually expressible as 
a simple “divisor-function”’ of x. For example, in Glaisher’s notation, 


Tyo(%) = B{E,(2) + LOL y(n) + 8x4 (m)}, 


where En)= DY (-1ya, 
dodd, dln 

Bin= YD (~-)Hiewnea 
a@ odd, d\n 


(d@' being n/d, the divisor of n “conjugate” to d), and 
u(r) = 2 YD (a+or)4 
C+h=n 
(a sum extended over the Gaussian complex divisors of n), There are 
similar formulae for 2s = 12, 14, ..., 7,,(n) being in each case the sum of a 
“‘divisor-function” and one or more supplementary functions. The com- 
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plexity of these supplementary functions increases with s, and it is only 
the simplest of them, such as y,(n), which can be defined in an illuminating 
arithmetical way. The majority can only be recognised as coefficients in 
the expansions of modular functions. 

We shall find, however, that there is always a divisor-function which 
‘“‘dominates”’ r.,(7). In all cases 


Togl%) = Joq(M) + n5(%), 


where 6,,() is a divisor-function and e¢,,(”) is much smaller than 6,,(n) 
for large n, so that raq() ~ (2) 
when tends to infinity. 


9.4. I propose to work out two special cases in detail here, 2s = 8 and 
2s = 24. The analysis is a little simpler than usual when 2s is a multiple 
of 8, but these two cases are quite typical of the general theory. In the first 
ease I shall obtain the classical formulae of Jacobi, in the second the most 
characteristic of Ramanujan’s new theorems. I do not think that any 
complete proof of this has been published before. 

I shall not use Jacobi’s or Ramanujan’s methods, but the later “function- 
theoretic” method devised by Mordell and myself, which shows up the 
foundations of the theory much more clearly. I must however begin with 
a few general remarks about Ramanujan’s contributions to the subject, 
which are set out in two substantial papers in the Pransactions of the 
Cambridge Philosophical Society.? 

It is always difficult to say how much Ramanujan owed to other writers, 
and the difficulty is at its maximum when he is developing work which he 
began before he came to England, and when he is concerned, as he is here, 
with some side of the theory of elliptic functions. There is no book on elliptic 
functions which he could have seen in India and which says anything about 
the arithmetical applications of the theory. I believe therefore that Raman- 
ujan had rediscovered Jacobi’s formulae, which certainly lay well within 
his powers. 

By the time he published these papers Ramanujan had read a great deal 
more, and knew all about Jacobi’s and much later work. In particular he 
had read Glaisher’s papers, and treats their content as known; and a reader 
who takes his acknowledgements at their face value will be liable to under- 
estimate his originality. The papers are highly original, whatever deduc- 
tions we may make: they are characteristic of Ramanujan at his best. They 
contain many remarkable theorems which are undeniably new, and con- 
jectures still more remarkable, which were confirmed later by Mordell; and 


t Nos. 18 and 21 of the Papers. 
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the general level of the analysis is astonishingly high. In particular the 
second paper contains all the formal theory of “ Ramanujan’s sum”, which 
is fundamental for my purposes here. I shall have to start from formulae 
of Ramanujan even when I am developing the theory on lines entirely 


different from his. 
. ’ 
Ramanujan’s sum c,{n) 


9.5, Ramanujan’s sum is 
og(ms) = Yersnoni, 
pa) 
where the notation indieates a sum over values of p less than and prime 


to q.! It may also be written as 
2npr 


e,(2) = 3) cos 
P@ q 


(a form which shows that it is real), or again as 
C4(%) = 2pt fy 
where p, is a primitive q-th root of 1. 
It is easy to evaluate c,(m) in terms of the Mébius function p(n). This 
function is defined by 


(i) eG) =1, 
(ii) Bn) = (—1) 
ifn = p, py... p, is the product of v different prime factors, and 
(iii p(n) = 0 
if n has a repeated factor? The chief properties of x(n) are (a) that 
x #(d) = 0 
dia 
for all g>1, and (b) that the identities 
(9.5.1) 9(q) = & f(a) 
a\q 
and : 
(9.5.2) fta) = B.n($) oa) 
dig \a 


are equivalent to one another. The last theorem is usually described as the 
“Mdbius inversion formula”. 
Ramanujan proved that 


9.5.3 = q 
ven am) an la)4 


* Or any complete system of residues prime to g. 
* #(n) has occurred already in Lectures II and IV. See p. 23, footnote. 
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(a sum extended over all common divisors of ¢ and 7). To prove this he 
observes that 


q-1 
7) (n) se bs em 2nhriig 
. h=0 


is g if qg|n and 0 otherwise. But it is plain that 
y(n) = > C4”), 
ala 


and therefore, by the Mobius formula, 


g 
n) = =) q(n), 
cal) = 3 0( 8) nal 
which is (9.5.3). 
There is another proof which is a little longer but depends on principles 
which will be useful later. We say that f(g) is multiplicative if 


(9.5.4) fla?) = fla) f(a’) 


whenever (g,q’) = 1. In particular this involves f(1) = 1. It is plain that, 
if we want to prove that 


(9.5.5) AQ = F(Q); 


and know both f(g) and F(g) to be multiplicative, then it is enough to 
prove the result when q is a power of a prime. 
Now, if (¢,9’) = 1, we have 
Cyt) Coplr) = Sy e-trorla. Sy e-tnn' nie 
pa) py) 
a >» e-tnPrifag’ 

mq), pa") 

where P=q'+n'q; 


and P runs over the range P(gq’) when p and p’ run over p(q) and p’(q’). 


Hence erenPrilid = YY e-2nPritd = 6 (n), 
n(Q), 2a’) Pag’) 


and Ramanujan’s sum is multiplicative. 
Again, if we denote the right-hand side of (9.5.3) by C,(n), we have 


(9.5.6) O(n) G,(n) = 24) (5) dd 3 =4(%) dd’. 


Here dig, d|n, d’|q’, d’|n, and these relations are equivalent to 
dd’|qq', dd’|n 


(since g and q’ are coprime). Hence the right-hand side of (9.5.6) is C,,(n), 
and C,(n) also is multiplicative. 
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We have therefore only to prove that 
Cgi(n) = Cyr(n) 


when @ is prime. The values of p in p(w*) are 


pao atp, 
where z = 0, 1. 2,..., @—1 and p, runs through p,(w*-1), Hence 
wl y 
Ca! (n) —_ > e-2np,7i}o* xs ea tnenife 

pak) z=0 

and the inner sum is w when w |» and zero otherwise. It follows that 
(9.5.7) Coi(n) =o YL eermrtio* = we lia(m) 
plat?) 


if a|n and » = wn,; and that co,(n) = 0 otherwise. 


Now ¢, (2) = os en tnpriie, 
so that i 
(9.5.8) c.(n) =~1 (mtn), cg(n)=w-1 (wln); 
and we can now calculate c,:(”) by (9.5.7) for every &. We find that 
(9.5.9) co:(n) = 0 (orn), —o (aln, wt), w(o~-1) (w*}n), 
and generally 


(9.5.10) 
cain) = 0 (wt 42), —~ok-1 (oI 1a, oF rn), w-N@—-1) (o*[n); 


and we can verify at once that these are also the values of C,:(n). Thus 
¢,(n) = C(n) whenever g = w*, and therefore for all g. 


The series q-*cq(n} 
9.6. Ramanujan summed a large number of series of the form 2a,c,(n). 
The simplest is 
(9.6.1) Uia)= 5 


There are a number of interesting ways of summing this series. It is abso- 
lutely convergent when s> 1, since | c,(n)| $@(n) for every g. 
(i) The shortest way is due to Estermann. We can write c,(n) in the form 
o(m)= al), 
im=a,min 
PAG) 


so that Be yy pli m-s, 
g Im=q,min 
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When we sum with respect to g we remove the restriction on 1, which now 
assumes all positive integral values; and so 


(9.6.2) Uln)= X m(lyremts 
oe SMO _ oraln) _ n-04_4(n) 
7 x a aa 


min ea &s) = &(s) 


where o,() is the sum of the v-th powers of the divisors of n. 


(ii) Ramanujan argued as follows. We suppose that F(x, y) is any function 
of the two variables z and y, and that 
Din) = BF (4,5); 
d|n da 
and define 7,(n) as in § 9.5, so that 7,(n) is v or 0 according as v is or is not 


a divisor of n. Then : 


D(n) = Pee (>, “) 


v 


for any value of ¢ not less than »; and so 
CAL n 
Din) = BF (v2) Beatn. 
ya1lY Vidi» 
Now ¢,(2) occurs in this series when j | v, or v = ju, in which case w $t/j; and 


therefore 


tl n 
(9.6.3.) D(n) = e,(n) 3 Fu, *) 


Suppose in particular that F(x, y) = x21’. Then 


D(n) = pe = 04,5(1) 


9 CAM Hoan) _, 
Uta, ae +735 ae Fioaee 


and o1_,(n) = cal) 3 


Finally, if s>1, and we make to, we obtain (9.6.2). 


(iii) A third proof proceeds on lines like those of the second proof of § 9.5. 
We start from the identity 


$10 ifr 4fO L094 erty 


g=l 


where f(g) is any multiplicative function and w runs through the primes. 
The identity reduces to “Euler’s product” when f(g) = 1. 
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: (7) | Cx?(t) 
In this case Xo = it ae geo 


and we can quote the formulac (9.5.10). Suppose that w* is the highest 
power of w which divides n. Then 


Xo = 1-o* 

ife = 0, and generally 

w-l1 o-1), otl(o-1) ot 

Xo 1 wo an ms beet gs ~~ glarDs 
] — pet) G-s) 
ae (l-a-) T—@l-8 
(1-8) 
c,(n) aay Tp ETE _ 1-37) 

Hence y ro nl (l-o Ir = gs)? 


by the ordinary formula for a sum of powers of divisors of n. 
In particular, when s = 2, we obtain the formula 


—1)" 2costnr 2cosinz , 2(cos2nm+ cos tnt) 
172 i ( 5 2 a 5 5 
a(n) = 47 n(t + 38 + 3 + ze Be spate 
for the sum of the divisors of n. The formula shows in a very striking way 
the oscillations of o(n) about its “average” 47’n. 

We have supposed that s> 1, when our series are absolutely convergent. 
We can write (9.6.2) as 


(9.6.4) 5 Bee 


The first factor on the right is a finite, and therefore absolutely convergent, 
Dirichlet’s series, and the second a Dirichlet’s series convergent for s2 1; 
and therefore, by a familiar theorem on the multiplication of Dirichlet’s 
series, the series on the left is convergent, and (9.6.4) is true, for s=1. 
Putting s = 1, we obtain 

‘Gi ae +o) 


+...=0 


(a theorem of the same depth as the prime number theorem). 
Ramanujan made a number of similar summations, among which 


a) eat) 


€,(7) log 1+ = log 2+ log 3+. ~d(n) 
and mi e,(2) cate) + cal”) —...) = 7,(n) 


are two of the most striking. 
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The series Le,q-%c,(n) 
9.7. The series which is important for our present purpose is not (9.6.1) 
but 


2 ¢(n) 
Bl Vin) = ce 
(9.7.1) (n) on e 
where €,=1 (q=1,3), 0 (¢=2), 2° (¢=0), 


the congruences being to modulus 4. This series may be summed by any 
of the methods of § 9.6; I select the first as the shortest. I define o*(n) by 


oF (n) = a,(n) (n odd), 
o%(n) = o%(n)—o9(n) (nm even), 


o(n) and o%(n) being the sums of the v-th powers of the even and odd 


v 
divisors of n; and I prove that 


nics t 
(9.7.2) Vin) = oH EG) og4(m). 
We have 
Vin)= qe,(n)+2 Yo ge (n) = V(n)+ Rin), 
@=1,3,.. THA, B sere 
say. Here, first, 
Vin) = (my? Sy wm 
a=1,3,... Im=qm|n 
= 5 Mim = ot tn) 
im=1,3,...,m|% 1=1,8,... E 
i of (1) 
= ota) TL (135) = Wass 
Next, 
Kin)=2 DY gt Y pma2 Y p(t) me, 
g=4,8,... Im==q,m|[n Im=4,8,... 


If 4|2, then (2) = 0. Hence we may suppose that either (i) 1 is odd and 
4|m or (ii) I = 21,, where 1, is odd, and 2|m. The terms of type (i) give 


gt ee a, Mot aln) 


inet | | eee)’ 
where the double index indicates a sum over doubly even divisors; and the 
terms of type (ii) give 
Hay) De m8 = Of —s(M) 


4=18,.. FE atmmin (1-27) &(s)’ 
* We have not now two alternative forms like those in (9.6.2), since 


; nigh (n) + of (2) 
when 7 is even. 
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Collecting our results, we find that 
(1-2-8) &(8) Vn) = of_.(m) ~ o4_g(”) + 2ajL,(”); 
and we have only to verify that 
(9.7.3) 03_4(n) — 4.62) + 8a%_,(n) = nI-8o% (7). 
This is obvious if n is odd. If x = 2, where N is odd, and 6 runs through 
the divisors of N, then the left-hand side of (9.7.3) is 
ZSi-s — F(2d)-8 = (1 — 24-8) G18 = (1 — BI-#) N18 YG84 
= (2942-1) nl" Fs41 = n-9{F(20)81 — LY} = 0 oF_ (rn). 
Finally, if 2 = 24, where «> 1, then it is 
LGl-s—{B-s p slag | 4 Qad~o Vpt-s 4 {dts 4 Gls ,,, + Qala} Als 
= {14284 dos... + Qe-Ul-s) _ galls} gis 
= NI{] 4 Qt-s4 gle. + 2le-DG—s)_ gatl—s) Xgs-t 
= ni-s[Qale-D 4 2e-Ws-D | + 28-1 1} Set 
= m-8{g8_,(n)— o8_1(n)} = nt SoF_y(n). 


This completes the proof of (9.7.2), 


The singular series in the problem of 2s squares 

9.8. I must now introduce ideas which are not to be found (at any rate 
explicitly) in Ramanujan’s work.’ They are the ideas from which Littlewood 
and I started in our work on Waring’s problem. 

It is easy to find asymptotic formulae for the behaviour of 

f(s) = 8x) = (1+ 2xe4 204+...) 
where x tends radially to a “rational point” e?”7"2 on the unit circle. We 
may suppose that q = 1, p = 0 or thatg>1,0<p<gqand (p,q) = 1. If 
2 = perprilg 


and r> 1, then a 
Ha) =142 > prt etn®prila 
1 


lath? otatipatia 


li 

pow 

“b 

to 
Mes 


o 


a 
eParila y plats, 
t=0 


eas Tee 


tl 
Pay 


Ifr = e-*, so that S> 0, then 


oO s ao co 
yy Mate = Feats? ~{ ee 5RatN® doe 


1=0 t=O 0 
Sees 1 m\ ala 1\+ 

~ 87g? Jo — For) |p ae fa : 

[revetae = 35 M5) = Fy (eee) 


* Except in our joint work on partitions. 
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and hence ? 
7 a Ves 
(9.8.1) B(x) ~s, (oe) Z 
where 
q 
(9.8.2) Spq = du eh pntia 
jel 


is one of “Gauss’s sums”. If S,, = 0, as happens when g=2 (mod 4), then 
(9.8.1) is to be interpreted as 


1\-? 
Ha) = of(ioe) . 
It follows that 


(9.8.3) jte)~2~*{"22)" (logs). 


r 


We now form an auxiliary function which mimics the behaviour of f(z) 
when x approaches the unit circle in this way. It is known that if 


E(2) = S neta, 
zk 


1\33 
then F(x) —I(s) (tog :) 
is regular at « = 1. Hence, if 


° (8 
Foal) = Tal 


Sea)" River), 
then fnae)~m{22)" (log =) ~Ft0 


when x tends to ¢?”7#, Thus f,,,(z) “mimics” f(x) near this point; and, if 
we write 
02,(%) =l + dFoal®)s 
Py 


then we may expect that @,,(z) will mimic f(x) near all rational points 
epriia, To say this is to say that ©,,(x) mimics f(x) very comprehensively, 
so comprehensively that there should be a very close relation between the 
coefficients of the two functions. If this be so, then the way will be open at 
any rate to an approximate determination of r,,(1). 


+ 7s Sa c\ Bee F 
Now @,,(%) = 1 +76) »» (*2) 2a ienk an 
= 1+ % po(n)x”, 
n=1 
28 
where Pog(t) = 7a} nweIy (*2) e-2nprig, 
Ba 
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We can write this as 


(9.8.4) Pol) = 77m? & Aaln), 
@= 
where 4,(2) = l and 
(9.8.5) A,(n) = 3 Sis,e-2nprila 
pia) 


when g>1. We are entitled to expect a pretty close relation between r2,() 
and p,,(z). It is only an expectation, since our analysis has been entirely 
“heuristic”; but it is plainly one worth pursuing. 

We call (9.8.4) the sengular series. Our construction is typical of that 
of the “singular series”’ in the general Waring problem. 


Summation of the singular series when 23 =0 (mod 8) 
9.9. The singular series can be summed for all s. The analysis is simplest 
when 2s = 0 (mod 8), as I shall suppose. 
The Gaussian sum S,, (or simply S,, if we omit the explicit reference 
to p) can be calculated, for all p, ¢, from the formulae 
Spa = Spa.a Span’: 
S,=1, S,=0, Spy = 24147"), Shower = Qettedrrt, 


= (2) iO Io, Soop = OH, Shaper = WAS. 
Here g and q' are coprime, and @ is an odd prime. The formulae for the 8-th 
powers become much simpler, and we find that 
Sora = a0 
when 2s=0 (mod 8), ¢, being the symbol of §9.7. It follows that 
A, —_ nid 3 en 2npntig = €gY Cy(2), 
(a) 

and that, in the notation of §9.7, 

(9.9.1) po,() = =~ Vin). 
Thus the singular series is (apart from the outside factor) Ramanujan’s 


series (9.7.1), and so 


(9.9.2) cs 


Pal) = Tey FG) 
In particular, if 2s = 8, then 

4 
(1-24) f) = 5, 


(9.9.3) pan) = 1l6oF(n). 


H 
y Io 


O3..1(N). 
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If 2s = 24, then 


B 691 
9-12 = (] — 27-12) 211,772 8 Se Ne 
(1-29) £12) = (1-2-9) BNE, Be = 
and 
(9.9.4) Pogl®) = doy Oh). 


The modular functions 


9.10. We have strong reasons for expecting to find a fairly close resem- 
blance between 7,,(n) and /,,(7), and further analysis does more than 
would be required to justify our hopes, the correspondence being extremely 
close. Indeed when 2s <8 the two functions are identical; in particular 

7_(%) = Pg(m). 
The proof of this depends on arguments of a quite different character, first 
applied to this problem by Mordell. 


9.11, We need the elements of the theory of the modular group and the 
functions associated with it. The modular group I’ has two forms. In the 
homogeneous form it is defined as the group of substitutions 


(9.11.1) 1 = tu, +bW,, 4 = Cw, + dus, 
where a, b, c, d are integers and 
(9.11.2) ad—be = 1. 


In the non-homogeneous form we write 
(9.11.3) T=, 


and the group is defined by the substitutions 
,  ot+dr 
T=—, 
a+br 
We shall use any of the symbols 
a b e+dr 
8, ( ) : (= oe) 
+o denote the substitution (9.11.1) or (9.11.3). J"is generated by the repeated 
application of the two substitutions 


1 o\ (0-1 
117° \1 o 
or 
(9.11.4) r=T+l, rent, 


In what follows we shall always suppose that 
(9.11.5) Br) > 0, 
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so that 

(9.11.6) jaf = [er [ <1 
If7 =utiv, 7 =u'+iv’, then 

ii cet OCIS 
~ (a+ buyr+b2?~ ? 

so that one point in tbe upper half-plane is transformed into another, and 
only such points are relevant. 

We call the region D defined by D 

—hev<d, wWiv'ol 

the fundamental region of I. Each sub- 
stitution of T° transforms D into a curvi- 
linear triangle, whose sides are circles and / 
whose angles are (47, 47, 0). These triangles / 
cover up the half-plane without over- [ 
lapping. 

The fundamental function associated Fig. 3. 
witb the modular group is Klein’s 
“absolute invariant’’ J(r), whicb is defined as follows. We write 


L fa\4 18y2 234 
Yo = Jey, Wo) = (=| (1 +240 patent 2h 


—— 


nee 


at 
1 (7\s 15y2 Dba 
Gs = Gg(Oy, Og) = na(c) (1-504 + +} 


(tbese being the ordinary invariants of the Weierstrassian tbeory), 


1 
A = Alo, wg) = 38-2793 = (Z) w?{(1 a5?) (1 — a4) .. 324, 
1 


_# 
and J(r) = a: 


Then J(r) is a function of 7 only, invariant for the substitutions of I. It 
assumes every value just once in D (wben proper conventions have been 
laid down about the boundary of D), and represents D conformally on the 
whole complex plane (regarded as bounded by the points 0, 1, and oo). 

We call a function which, like J(r), is invariant for J'a modular invariant: 
the phrase “modular function”’ is used more vaguely. Tbe function 

J = SJ(r) 

plays, for modular invariants, tbe part played by z in the ordinary theory of 
one-valued functions f(z). Tbus a modular invariant, with properly restricted 
singularities, is a one-valued function of J. In particular, if a modular 
* Tuse « instead of the usual g, as in Lecture VIII, ¢ being required for other purposes, 


Ia-2 
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invariant is regular and bounded in D, then it is a one-valued function of J 
bounded in the whole plane of J, and accordingly it is constant. 


Functions associated with the sub-group I, 

9.12. We shall be concerned now with functions which are not modular 
invariants in the full sense just defined, but which are invariant, or “all 
but” invariant, for the substitutions of a certain sub-group of I. 

Itis easily verified that the substitutions of which satisfy the congruence 


conditions* a db 10 01 
(° 28(6 ") or e ‘) (mod 2) 


ue D D(T+1) 


Fig. 4. 
form a group, @ sub-group of I’ which we call I. I, is generated by 
(9.12.1) T=74+2, vans 


It has a ‘fundamental region” D, defined by 

-l<u<l, wv?>l); 
and the substitutions of I, transform D, into a system of triangles, all of 
whose angles are 0, which just fill up the half-plane.” 

There is a principal invariant J,(r) of I',, which is related to I, as J(7) 
is to I’ (but whose expression we shall not require). A one-valued function, 
invariant for I, is a one-valued function of J, and a three-valued function 
of J. Finally a function invariant for I’, and regular and bounded in Da, is 
constant. 


t Kither a and d are odd and b and ¢ even, or conversely, 

1 D, may be described roughly as formed by fitting together three regions congruent 
to D (i.e. transforms of D for substitutions of J’). More strictly, it is formed by D 
and one-half of each of the four regions 


Dir+l), D(r—1), 0(-i+ i) o-; os 7 
T 
(the transforms of D by 7’ = 7 +1, ete.). In Fig, 4, D, is bounded by the thicker lines. 
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Proof that ra(m) = pa(”) 
9.18. The functions 
98 = 9% (x) = O8(0,7) = (1+ 2+ 2at+...)8 


and Og = O4(x) = 14+ Z pgm) 2”, 
rl 


where z = e7", are one-valued functions of r. If we can prove that 
(A) 8-80, is invariant for Ps, 
(B) 9-80, is regular and bounded in Dg, 
then it will follow from the general theorem of $9.12 that d-*0, is a constant, 
which is plainly 1; and from this it will follow that 
ra() = palm) = 16o$(n). 


Proof of (A) 
9.14. It is sufficient to prove that 3-80, is invariant for the two sub- 


stitutions 1 
S,(7+2), 8 (-;) 


which generate J',. We can prove, quite generally, that 9-80, is invariant." 
In the first place 
(9.14.1) 95(0, 7 +2) = 8(0,7), 


(9.14.2) on(o, -;| = 799250, 7), 


by the familiar formulae for the linear transformation of the $-functions.’ 
It remains to determine the behaviour of @,,. It is obvious that @,, is 
! We are supposing 2s = 0 (mod 8), but the proof is very much the same in other cases. 
3 We shall require the full table for the funetions 
9,(0,7) = Qt +2ut 42a + ..., 
G,(0, 7) = 14+ 2x4 2Qat+ Qah+..., 
(0,7) = 1~2a+ 2ak—2e9+..., 
which is 
$,(0,7+1) = f80,(0,7), 940, 7+1) =95(0,7), 9,(0,7+1) = 9,(0,7), 


1 
vo -;] =. Lo40, T)s 


I 
v4(o, -;) = Fos0,n, 


l\_ vr 
#,(o, -3) = Renn 


Here ,i = e!* and 7 has its real and imaginary parts positive. The notation is 
Tannery and Molk's, and #,(0, 7) = &(0, 7). 


150 The representation of numbers as sums of squares 


invariant for S,, since x is invariant for S, (and so that 8-*0,, is invariant), 
This will also appest incidentally from the analysis which follows. 

The function F(z) of §9.8 is elementary. In fact, if w = e-¥, so that 
y= —TiT, 


s—2 
Ba) = Dns ta = Yn te = (=) Yne-v 


d\s2 ev gd \s-2 
= (s,) (ent (5) 4 cosech* dy 


= (=) 3 y+ a Sug = y+ ani oa = z an 7) 
Also wen2nrila = enitr—2p/a), 
anaes al as a : Dire T : (2p/q)}*’ 
Foal®) = Figs (~2t) Reet 
Tad Biac-wrih) = 6, 3 Baap ca’ 


Hence 


143) Ox(c) = =1+ 3 pa oa 
0183) Gale) 14 Bode) = 1 3 Beate) a 


where the range of summation is defined by 
q=1,2,3,...3 0<9<q, (p,g)=1; -wo<n<0 
(except that »=0 when g=1). 
We can write (9.14.3) in the form 


9.14.4 
Cras walt pay 


where g = 1, 2,... and pruns through all values (positive or negative) prime 
to q; and this is 


1 2 
9.14.5 O,,=1+ =o + Se TESST 
(9.185) Oxe= 1+ 2% Op—any* gabd,.. par 
1 1 
=1+ a+ 
pte (2p — gr) ont. (p—ary 
1 
=1+ : 
2 @-a 


where now g = 1, 2,... and p runs through all values prime to and of opposite 
parity to q. 
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We wish to remove the restriction that » should be prime to g. We can 
do this by multiplying both sides of (9.14.5) by 
ns) = (1-2-8) (s) = 1438-45-84... 
We thus obtain’ | 


(9.14.6) (8) O25 = W8)+ Doane 


where g = 1, 2, 3, ..., and p runs through all values of opposite parity to q. 
We may also write this in either of the forms 


ra 4 1 = « — 
(9.14.7) (8) 9x5 = —(8)+ Baye (q=0,1,2,...5 p+q=)), 
I 
14 = 42> =1). 
(9.14.8) 9(8) Ozs = 4 (oan! (p+q=l) 


Here the congruences are to modulus 2, and gq, in (9.14.8), runs through all 
integral values. 
If we write (8) Og, = x(7); 
then it follows at once from any of (9.14.5)-(9.14.8) that 
X(t +2) = X(7). 
This, as I pointed out, obvious from the beginning (but a useful check 


on our analysis). 
We use (9.14.8) in investigating the effect of the substitution S,. It gives 


I 1 
~-} = 4 
x “} $ a (pt +4) 
= in ——— = TY(7), 
. * prq=i(p— qr) x(n) 
on replacing p, g by —4, p. 
Thus y(r) is affected by S, and 8, in just the same way as 0’; and 9-@,, 


is invariant for S, and Sy and therefore for I’. 
We have thus proved (A). More generally, we have proved the invariance 
of 8-*@,, whenever 2s=0 (mod 8). 


Proof of (B) 
9.15. The functions }* and @, are defined by power series in x = e7!* 
convergent in the circle | x| <1 or the half-plane «> 0. Also 


Hw) = TT {1 — 2%) (1 + 22"-4)2} 


i) 


* A pair (p,q) of opposite parity is of one of the forms (P, Q), (8P,3Q,...), where P 
and Q are coprime and of opposite parity. 
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has no zeros in the circle. Hence #80, is regular for |z|<1orv>0. Itis 
bounded in any closed part of D; which excludes the two points in which 
D, abuts on the real axis; and therefore it is bounded in D, if it is bounded 
in the neighbourhood of the two points 7 = +1. It is plain that we need 
consider only the point 7 = 1. 

We write 


(9.15.1) r=l-z, T=—— 


and suppose that T->1 inside D3, so that O0<u<1 and u?+2?>1, If 


T= U+iV then ie 5 


= (au) ty? * (a uyee? 


so that 0< U<1 and V-+oo. Hence T > 0 inside Dg, and 


U 


X = e"T+0, 


Now 98(x) = 9§(0,7) = oo, 1 -7) = T#98(0, T) 
= THOX8 4 2X44...)%, 
so that 
(9.15.2) 98~ 256T4X2 


when Too and X—> 0. We require a similar formula for @,. 
We have from (9.14.8) 


4 
(9.15.8) 7 O,=x{r) = x1) 


1 
= 4T42 +g=l1 
Geeagae Ue 
1 
= Og PT 
where now @ runs through all integers and P through all odd integers, If 
[P[T=¢ f=e% = XI, 


1 1 l/d\? 1 
SgePTE Smse ala) @+eF 


1 5) ™ lat = 1 a 2 dop2e2rté 
x cosec' c=—% z) erp 


=o a(S) (e276 + Qesril 4 Ze67HE 4...) 


== Bart(ermis s QB eanit + g8g6rts +... -) 
= Sp XAPl 4 BXAUPL + B9XSFl + ,..), 
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We hare to sum this with respect to P, but we need only the lowest powers 

of X. and X? oceurs only for P = £1. Hence (9.15.3) gives 
4 
a Og~$. 2.40 TAX?, 

or 
(9.13.4) O,~256T4X?. 

Finally (9.15.2) and (9.15.4) show that 3-0, is bounded when 7->1, and 


therefore bounded in Dy. 
It follows that #-®0, is a constant, which must be 1, and thatr,() = p9(m). 


24 squares 
9.16. We proved in $9.14 that 3-*@,, is invariant for I, whenever 
25=0 (mod 8), and this is true in particular when 2s = 24. If 9-*40,, were 
bounded in Dg, it would follow that 9% = O,, and 7.4(") = Po,(n); but this 
is untrue. The correct formula, which was found first by Ramanujan, and 
which I shall proceed to prove, is 


(9.16.1) 9*4(2) = Og,(x) — 23482 g( — x) — 98535 g(x”), 
where 

(9.16.2) g(x) = a{(1 —x) (1—2?) (1-23) ...}4, 
so that g(a?) = w{(1 —2?) (1-24) (1-2) ...}24 = h*4(7), 


in Tannery and Molk’s notation. This last function is, apart from a factor 
of homogeneity, the discriminant A(w,, w). 
We require the formulae for the linear transformation of h(r). These are 


(9.16.3) A(r-+1) = ef? A(7), 

(9.16.4) (-2) = e-¥t Ir h(r). 
We shall also use one other formula, viz. 

(9.16.5) m(7 *) = ett h(r)94(0,7). 


This belongs to the theory of quadratic transformation, but is a simple 
corollary of the product formulae for h(r) and 9,(0, 7). 
The three functions 


(9.16.6) $-4Q,,, $-%Mg(—x), B-Ag(x2) 
are all invariant for I. We have already proved the first invariant. The 
invariance of the third follows from the formulae 


gla?) = B2(r), h®(r-4.2) = br), ne = *| = Thr), 
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Finally g(—2) = no{F*) = ~ m4) 9900,7), 
by (9.16.5), and the invariance of the second of the functions (9.16.6) 
follows from the formulae (9.14.2) and (9.16.4). 

Hence & 4OF, = 0-*4(O,, + ag( —2) + Bg(a?)} 


is invariant, for any « and #. We shall prove that it is possible to choose 
a and # so that -#O#, is bounded in D,. For this, we use the substitution 


(9.15.1) of $9.15, and examine the behaviour of all the functions concerned 
when T-> 00. 


(i) First, by (9.15.2), 
(9.16.7) on(o, 1 -3| ~ QA S, 
(ii) Secondly 


g( —2%) = —h? (7) H2(0,7) = —w(1 - 7} 0¥(0. 1 -3| 


= s( 7) 88(°. z)=(G) eine. 


= TEX{(1—X) (1-24)... e(2Xt4 QNF+ ye 


Hence 
(9.16.8) gl —a) = QUT X44 O(XS). 
(iii) Thirdly 
nt) = 4) = (12) = a) 
= TYA(T) = T2K4(1 — 2) (1—K4)...}%%, 
(9.16.9) g(a?) = TH{X2— 24X44 O(XS)}. 


(iv) Finally we have to determine the behaviour of @,,, which can be 
done by calculations like those of § 9.15, We have now 


1 
(12) Oy = aT De Phe 


1 1 1 /d\0 1 
~ a PHR- are ile! CESS 
_ On) 
ll! 


(2n)2 
11! 


(e2ntt ae Dllesrté SPs :) 


(X2P' 4 QUXAPl4 ..), 
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It is again only the terms with P = +1 that matter, since P = +3 yields 
OLX). Hence we obtain 


(9.16.10) pg ae: 


1X24 QUX44 OKs) 
TA gy PLP 2h zt + OE} 


= 2 Tf x2 ALY X6 ; 
=i {X24 1X44 O(XS)} 


on inserting the value of 4(12). 
We have to choose ¢ and f so that 


(sty 1X44 O(XS)} + 0.204 X44 O(XS)} 
+ B{X*— 24X44 O(X%)} = O( XK). 


It will then follow from (9.16.7), (9.16.8), (9.16.9), and (9.16.10) that 
2-408, is bounded. Equating coefficients, we find that 


33152 36 
a=—Se, B r 


655 

and (9.16.1) follows. F 
The function T(n) 

9.17, We define r(n)' as the coefficient of x” in 


gle) = a{(1~x)(1—2?)...} 4 = Er(n nm) at 
Then, after § 9.9, 
Ong = 14 Dpy(n) a" = 14-8 Doh(n) a. 
1 T 


igs one 3 (- 1)? r(n) a, 


wo 
and g(a) = Sir(}n) a, 
1 


if we agree that 7(y) meaus 0 when y is not an integer. Hence finally we 
ohtain Ramanujan’s formula 


(9.17.1) Toq(®) = doy TT (%) + Coal”), 
where 
(9.17.2) eoy(m) = $28 {(—1)*1 2597(n)— 5127(4n)}. 


9.18. The function r(m) has been defined only as a coefficient, and it 
is natural to ask whether there is any reasonably simple ‘‘arithmetical” 
definition; but none has yet been found. In the next lecture I shall discuss 
some of the most remarkable properties of the function. I must however 


* Tretain Ramanujan’s notation. The collision of ther in r(n) and the f in e" is a little 
unfortunate but is not likely to cause confusion. 
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prove something here about the order of 7(m), since I have to justify my 
assertion, in § 9.8, that 1.,(7) is “dominated” by p2,(7). 
Jt follows from a formula of Jacobi which I have quoted several times 
already that 
Ir(n) a” = af{(1—w) (1-2)... = a(1 ~ 80 + 528 — 7x8 + ...)8, 
the exponents in the series being the triangular numbers. Now (1—3a+...)8 
is majorised by 3 8 
| > (Oni) aino| : 
n=d 
which is of order (1—2z)-* whenz—>1.* Hence 
| r(m) |2®"< 2 | r(n)|a"<A(l—2z), 
where A is a constant, for all n and «. Taking + = 1—n7, when 2” is about 
e~1, we find that 
(9.18.1) r(n) = O(n’). 
On the other hand 
16 mint 2 a(n). 
$1 oh (2) = Pon) = cr ne qe 


and the series is greater than? 


8 224 5 FT 212.8 


1-ga7 Al2 p12 712 ——«g2 Ph 


Hence o%(n) is greater than a constant multiple of n4, and 
16 I 
Poa(n) = aor of(n)(1 oh o(-s)| 


is dominated heavily by its leading term. 
The order of 7(n) is really a good deal smaller than is shown by (9.18.1). 
Ramanujan showed, by a more sophisticated method, that 


(9.18.2) t(n) = O(n"); 
and IJ showed later, by a function-theoretic method, that 
(9.18.3) (n) = O(n). - 


Ishall prove a better result, dueto Rankin, in Lecture X. It is very plausible 
to suppose that (as Ramanujan conjectured) 


t(n) = O(n't+*) 


for every positive ¢; but these questions I must postpone. 


oO 8 
t That of (Xnatn)® or of i teva) , where e-¥ = @. This is that of y- or (1—-2)-*. 
it) 


2 Using the crude inequality | ¢,(n) | Sn. 
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9.19. I conclude by repeating that the results which we have proved 
are typical of those in the general problem of 2s squares. We can always 
express @.,(7) as the sum of a number, fixed by Ramanujan and Mordell, 
of terms defined as modular coefficients; and each of the coefficients gives 
rise to a series of problems resembling those arising from 7(m). In some cases 
it is possible to define them fairly simply in arithmetical terms. In all cases 
the number of representations is dominated by the divisor function p,,(n). 


NOTES ON LECTURE IX 


§9.1. There is a full account of the history of the classical theorems concerning 
representation by two or four squares in Dickson, History, ii, chs. vr and vim. 

Jacobi’s results concerning 2, 4, 6 and 8 squares are quoted by Smith on p. 307 of 
his Report on the theory of numbers (Collected papers, i, 38-364). They are contained 
implicitly in §§40-42 and 65-66 of the Fundamenta nova. Liouville gave formulac 
for 10 and 12 squares in the Journal de math. (2), 11 (1866), 1-8 and 9 (1864), 296-298. 

Gauss, Disqusitiones arithmeticae, § 182, stated a theorem equivalent to (9.1.1). 

Glaisher, Proc. London Math. Soc. (2), 5 (1907), 479-490 (480), gives a systematic 
table of formulae for r,,(n) up to 2s = 18. He had obtained theso formulae in a series 
of papers in vols. 36~39 of the Quarterly Journal of Math. The formulae for 14 and 18 
squares contain functions defined only as the coefficients in certain modular functions 
and not ‘arithmetically’. Ramanujan, in no. 18 of the Papers, continues Glaisher’s 
table up to 2s = 24, and gives a general identity for 0*5(2) which was proved after- 
wards by Mordell (in the first paper mentioned in the note on §9.4). 

Boulyguine gavo general formulae for r,,(n) in which every function which occurs 
has in a sense an arithmetical definition. Thus the formula for r,,(m) contains functions 


of eae Hy RP 2G (ty, Way 014 Lt) y 


where ¢ is a polynomial, ¢ has one of tho values 2s~— 8, 2s~16, ..., and the summation 
extends over all solutions of x3+23+...+2? =n. There are references to Boulyguine’s 
work in Dickson’s History, ii, 317, and the papers of Uspensky quoted below. 

Uspensky has developed the elementary methods which seem to have been used by 
Liouville in a series of papers published in the Bulletin de Acad. des Sciences de 
PURSS and other Russian poriodicals: references will be found in a later paper of 
his in Trans. Amer. Math. Soc. 30 (1928), 385-404. He carries his analysis up to 12 
squares, and states that hismethods enable him to prove Boulyguine’s general formulae. 
They can also be applied to many othor problems concorning roprescntations by 
quadratic forms. 

H. Bessel (Dissertation, Koénigsberg, 1929) has developed Liouville’s methods 
independently, and gives formulae up to 2s = 16. 

Tam not concerned in this lecture with odd values of k, but it may be useful to add 
a short note on the subject. The functions 7,(n), 7;(n), and r,(n) may be expressed as 
finite sums involving symbols of quadratic reciprocity. Thus r,(n) was evaluated in 
this form by Dirichlet, and 7,(n) and r,(n) by Eisenstein, Smith, and Minkowski. 
When & = 3 the problem is, as had been shown long before by Gauss, much the same 
as that of finding the number of classes of binary quadratic forms of determinant —n. 

The results do not seem to have been worked out so systematically as those for 
even k, and it is not possible to refer to a comprehensive statement of them, though 
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many formulae can be found in chs. vir and rx of the second volume of Dickson’s 
History and ch. x of Bachmann’s Die Arithmeitk von quadratischen Formen (I Abthei- 
lung). 

‘Thhove are two completely different solutions of the 5 and 7 square problems, the 
‘arithmetic’ solution of Minkowski and Smith and the ‘function-theoretic’ solution 
of Hardy and Mordell. Bachmann gives an account of the first and Dickson, Studies 
in the theory of numbers, ch. ximz, of the second. References to the work of Hardy and 
Mordell are given in the note to §9.4. 

The formulae are generally stated, when £is odd, in terms of primitive representation 
(in which 2,22, ...,%, have no common factor). The simplest formulae are due to 
Dirichlet and Eisenstein; thus the number of primitive representations of an odd n 


by 3 squares is - 5 
243 (:) (n=l), 8 & (;) (n=3). 
‘ gSin re 


ssin 
Here ( = is Jacobi’s generalisation of Legendre’s symbol, and the congrucncas are to 
n 


modulus 4. Eisenstein’s formulae for § and 7 squares are proved by Bachmann. For 
references sce Dickson, History, ii, 263 and 305. 


89.2. The formulae (9.2.2) and (9.2.3) occur in Gauss’s posthumous work. Sce the 
references given by Dickson, History, ii, 283. 

There is a proof of (9.1.1), by means of Gaussian integers, in Hardy and Wright, 
240-242; and one of (9.1.2) and (9.1.3), by means of integral quaternions, in Dickson, 
Algebren und ihre Zahlentheorie, Kap. tx (see in particular 181-182, Satz 22), Landau, 
Vorlesungen, i, 110-113, gives an elementary deduction of (9.1.2) and (9.1.3) from 
(9.1.1). 

Ramanujan’s proof of (9.2.3) occurs in no. 18 of the Papers; the formula is (17) on 
p. 189. The proof is reproduced by Hardy and Wright, 311-314. Ramanujan seems to 
have used similar arguments frequently; Watson (10) points out that the familiar 
formes en’ut+sn2u=1, dn®*ut+k sn?u = 1 
appear, in the note-books, as identities between g-series to be proved by elementary 
calculation. Formula (18) on p. 139 of the Papers may be reduced to the familiar form 


(2"(u) = 8¢(u) — 490. 


Thus Ramanujan deduces the differential equation satisfied by (x), by direct algebra, 
froma its expansion as a trigonometrical series. 


§9.3. The formulae for 7,9(7) is Liouville’s: see the note on § 9.1. 
It may happen that e,,() = 0 for special forms of n. Thus 


exo(r) = 32 y(n) = 0 


ifm is not a sum of two squares, and e,,(n) = 0 ifn is even. These results, due to Liou- 
ville, were rediscovered by Glaisher. 

In order that es,(7) = 0 for all n, it is necessary and sufficient that 2s<8. In these 
cases only 7,,(7) is a ‘divisor function’, represented by the ‘singular series’ of § 9-8. 
The ‘reasons’ for this have been shown in a new light recently by Siegel, Annals of 
Math. (2), 36 (1935), 527-606. 

The theory of the representation of n by the special form 


(1) tot... +22 


can be extended to general definite quadratic forms in & variables. There are a certain 
number of genera of forms of a given discriminant, each containing a certain number 
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quoted at the end of the section. Suppose that f(z) is regular and bounded for 4(7) > 0, 


and that 1 
fT+1) =F(0), (-7) = f(r). 


Then g(x) = g(e!7) = f(7) cannot have an essential singularity at the origin (since 
then it would assume arbitrarily large values near the origin), nor a pole (since then it 
would tend to infinity); and so it is regular at the origin. We may therefore suppose 
(subtracting a constant if necessary) that (7) > 0 when §(7) + 00. 

Now | f(7) | attains its upper bound M in D at a point on the boundary of D (and not 
at infinity), so that there is a finite 7, on the boundary of D for which | f(7,)| = M. 
If f(r) were not constant, there would be a 7, near 7 for which | f(7,)|>| f(t») | = M. 
But there is a 7, in D for which f(7,) = f(7,), and so [ f(72) |> 24; a contradiction, 


. 


§§ 9.1415. The proof follows the second paper of Hardy quoted in the note on § 9.4, 


§9.17. (9.17.2) is formula (148) on p. 159 of the Papers. There is an error of sign 
in formula 7 of Table VI: Ramanujan seems to forget momentarily that 9(~ x) begins 
with —z and not x. In any case, as I remarked in the note on § 9:3, his 7,() and e,,(n) 
are half mine. 


§9-18. Ramanujan’s proof of (9.18.2) is to be found (as a special case of more general 
theorems) in the Papers, 146-148 and 153, or 160; and Hardy’s proof of (9.18.3), 
which is reproduced in a slightly different form in the next lecture (§ 10.8), in Hardy (9). 


xX 
RAMANUJAN’S FUNCTION 7(n) 


10.1. I proved in Lecture IX that 


16 
691 


where o7,(n) is a simple “divisor function” of x, and 7(m) is defined by 


(10.1.1) To4(t) = Zo Oh (™) + a (- 1 2597(n) — 5127(30)}, 


(10.1.2) g(a) = 2{(1—2) (1-2)... Pt = ut 


I shall devote this lecture to a more intensive ae of some of the properties 
of 7(n), which are very remarkable and still very imperfectly understood. 
We may seem to be straying into one of the backwaters of mathematics, but 
the genesis of 7(n) as a coefficient in so fundamental a function compels 
us to treat it with respect. 


The multiplicative property of T(n) 
40.2. The coefficients in many modular expansions have simple arith- 
metical meanings; thus the coefficient in 
H(t) = (1+ 2u4 2at+...)° 
isr,(n). But 7(n) has no such obvious interpretation,’ and its arithmetical 
properties are still very obscure. 
Ramanujan conjectured that 


(10.2.1) t(nn’) = 7(n)7(n’) 
if (n,n') = 1, ie. that r(n) is multiplicative; and this was proved a. little 
later by Mordell. Mordell’s proof is very instructive, and sufficiently simple 
for insertion here. It depends on the identity 


oa ao a 
(10.2.2) Lr( pn) a” = 7(p)  7(n) 2" — pt ¥ r(n) a”, 
1 1 1 
where g is prime. 
1 Jacobi’s identity shows that 
oo 
Er(n) ve = av(1—3e+ 5a3— Tab + ...)8. 
1 
Hence T(r) = L(— 1m tnar-+n9(2n, + 1) (2g +1)... (2g + 1), 
the summation being extended over all representations of n—1 as a sum 
8 8 
2 oa +1)= D4 
t=1 t=1 


of 8 triangular numbers #,; but this interpretation is not illuminating. 


HR Ir 
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We write, as usual 


(10.2.3) Aono) = (=) a°{(1 2%) (1a) ...}4, 
1 


where x = e"7, 7 = We/w,. Then A(w, w,) is invariant for the substitutions 


1 90 0-1 
a(t a) (ro) 
which replace 1, W, by w,, +2, and by —w,, @, respectively. We prove 
first that 
p-1 p-l 
(10.2.4) P= A(w,, pw2)+ DY A(pa,, Kw, + Og) = Ug t+ x, Vg 
«= c= 
is invariant for S, and S,, and therefore for all the substitutions of the 
modular group I. 
First, S, leaves u, unaltered, and permutes the v,, so that P is invariant 


for S,. 
Next, 8, changes P into 


p-l 


A(—g, BO1) + 4(— pore, 0) + iA (—Pw, 0, ~ KW) 


p-1 
= A( pin, Oe) + Aw, pre) + z A(— pa, Wy — KW); 


and, in order to prove P invariant, it is enough to show that 
(10.2.5) A(— pw, @1— KW) = A(pay, Kk’, + We), 


where x’ runs through the values 1, 2, .... 9—1 with x, or through these 
residues (mod 7). 


We take a@=—-Kk, b=, 
and determine c and d so that 
ad-—be = ~kd—pe = 1. 
Then «’ = d runs through the residues required. Also 
— api, +5(W,—Kw,) = p04, ~CpW, +d(w,—Kw,) = KW, + W., 
and so A(— pws, W,— KW,) = A(pw,, KW, +), 


which is (10.2.5). 
It follows that P is ievarant for I’, and therefore that 


P 


(10.2.6) Q= Ras 


is invariant. 


Ramanujan’s function 7(n) 163 


Now 
(10.2.7) 
7 12 «2 Sn T 12 © ‘ 
Aton) =(Z) Erim2%, (04, poy) = (FE) Sremaee, 
and 
p-l T 12 @ p-l 3 
(10.2.8) 3 A(pey, Kwy + 2) = (=) T(n) x2rP Y eknenifp 
x=0 Pw, n=1 c=0 
zy \12 oO 
= {— T( pn) v?R, 
(3) 2 Erm) 


since the sum with respect to « gives » if p|n and 0 otherwise. Hence the 
expansion of P in powers of » begins with 


(Z)"o™ 7(p) a, 


wy, 


while that of A(w,, #2) begins with 


a \he 
@, . 
Tt follows that Q = P/A is bounded in D, and therefore constant, so that 


(10.2.9) P=pr(p) A. 


Finally, substituting from (10.2.7)-(10.2.9) into (10.2.4), we obtain (10.2.2), 
with 2? in place of x. 
We can now prove (10.2.1). It is sufficient to prove that 


(10.2.10) T(p'n) = T(p*)7(m) 
for every A, » being prime and (”, ») = 1. This is obvious when A = 0. If 
we equate the coefficients of x” in (10.2.2), we obtain 
T(pn) = T(p)7(n), 


which is (10.2.10) for A = 1. If we equate those of x?*~'", where A> 1, we 
obtain 


(10.2.11) T( pan) = 7(p) T( pa) — pr(pi-*n), 
and in particular 

(232) 7(p) = 7(p) 7(p*)—pur(p*). 
Hoag uy = rnp) —r(n) 7p"), 
we have Uy = T(P) Uy P Una 


But uy = 0 whenA = 0 and A = 1, and therefore u, ~- 0 for all A. 
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7(2) 
ne 


The function F(s) =2 


10.3. From (10.2.1) we can deduce a remarkable formula for the Dirichlet’s 
series 


(10.3.1) Fs) = 5. 
1 
I proved in §9.18 that t(n) = O(n), 


so that (10.3.1) is absolutely convergent for o = 9t(s) > 9. We shall see later 

that much more than this is true, but this imperfect result is sufficient to 

show that the transformations which follow are valid for sufficiently large c. 
Since 7(m) is multiplicative, we have 


(10.3.2) Fs) = nl Xps 
where 

2 

(10.3.3) Xp = 1472). 


We can calculate 7(p*) in terms of 7(p) from (10.2.11), and so determine y,. 
We write 


(10.3.4) cos 6, = 4p-* 7(p); 
(10.3.3) dy = pPAr(p). 
Then (10.2.11) gives  @,—2cos@,d).. +a). = 0. 
sind sin 20, 
But =l= P = aa P. 
ru a= ain,’ a, = 2c0s6, sin 8, ° 
and it follows by induction that 
sin(A+1) 6, 
a4, = —, 
sin 0, 
and 
10.3.6 ay oe pa tin At 1) 4, 
(0.0) a ae oT 
Hence 
2 i 
10.3.7 = CF -9A gi I = 
( ) & sind, Zo? a Ee, 1— 2p" $086, +p 
= 1 
ed ie areal 
and 
(10.3.8) F(s) = = I 
. TX i 1—17(p) ps + pis 


This is the analogue of Euler’s product for &(s). The series and product are 
absolutely convergent for sufficiently large o, 


Ramanujan’s function 7(n) 165 
From (10.2.1) and (10.3.6) it follows that 


(10.3.9) a(n) = nt a 
ifn = Hp. 

The function F'(s) also satisfies a functional equation of the same type as 
that satisfied by ¢(s). It will be convenient to defer the proof of this to 
§10.9. 

Congruence properties of r(1) 

10.4. The results of §10.2 enable us to prove a number of curious arith- 
metical properties of 7(n). These have a certain resemblance to those of 
p(n), which I spoke of in an earlier lecture; but they are naturally more 
numerous, since 7(n) is multiplicative. 

Suppose that p is prime and that 


(10.4.1) T(p)=0 (modyp). 
Then (10.2.12) and (10.2.10) show that 
(10.4.2) T(pn)=0 (mod p) 


for every n. Ramanujan tabulated 7(n) up to » = 30, and his table shows 
that (10.4.1) is true for 


(10.4.3) p = 2,3,5,7, 23; 


so that these primes have the property (10.4.2). 
The last two primes yield further properties of r(n), which do not depend 
upon the multiplicative property. We can write g(a) in the form 


g(v) = Xr(n) a” = a T(1—a*)2 171 —2*)8, 


Now (l—a®)’=1—27" (mod7), 
(1—a")7) = 1 ~— 327 + 3atn—y21n (mod 7), 
and (1 aryl = Ze ,,%", 


c, being divisible by 7 whenever y is not. Also 
TI —a)? = 5(—1y (29+ Daher, 

by Jacobi’s identity. Hence 

Ar(n)a” = ZE(— 1) (2v+ lye,aeer rt 
and a(n) = 2(—1) (2v+1)e,, 
the summation being over all » and v for which 

n= dvv+))+ye+l. 
Now 4r(v+1)=0,1,38,0r6 (mod7), 

so that ney, 2+1, w+2, orw+4 (mod7). 
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lfn=3B, 5, or 6, 4 cannot be a multiple of 7, and then ¢, is a multiple of 7. 
Hence t(7m+h)=0 (mod7) 
for k = 3, 5, 6 as well as for & = 0. 
We can prove similarly (using Euler’s identity instead of Jacobi’s) that 
7(23m+k)=0 (mod 23) 


when k is any quadratic non-residue of 23. 


10.5. The congruences of § 10.4 are satisfied by all » of certain arithmetical 
progressions. There are also congruences satisfied by “almost all” 2, 
For example 

(10.5.1) t(w)=0 (mod 5) 
for almost all 1 (in the sense of § 3.4), 

We begin by proving that 

(10.5.2) t(n)=no(n) (modd), 
where o(n) is the sum of the divisors of n, for all x. This depends on two 
identities in the theory of the modular functions, viz. 


(10.5.3) Q3 — R® = 17289(2), 
and 
~ nar 
(10.5.4) Q—Pi = 2882, 
where 
2x2 ues 
(10.5.5) Pa 1-252 Wty tio ), 
Q3y2 —-B3z8 
10.5.6 =1 
(10.5.6) Q +240( + eee ) 


25y2 35x83 
10.5.7 R= } 
(10.5.7) 1 sou( Pee ae eee 2: 
The identity (10.5.3) is familiar, but I have not seen (10.5.4) anywhere 
except in Ramanujan’s work. 
We can deduce (10.5.4) from (9.2.5). We have 


1 6 8 
fcotdé = = —-— —- —— + 
26 24 1440 7? 
1 1 i 
Tent LO eas cee tee 
(f cot 38) sat get 
ujsn6 wen 204...2-2— 1p Q- 1 os 


24 Tag 
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20,(1 + th) Cos 6 + tg(1 + tg) cos 26+... 


Le ates lj 222 
= tat ace sl oe tae acai = 
}{u,(1 — cos ) + 2u9(1 —cos 26) +...} = — 624. 


Q ge Brtals s Bde x “ 
(5-9 ~ 1440 ~ 46% 24" (l—aye" (1 a2y2 


Q oil a D2Pqy2 ; ; 
+] 9607 SiG aye? Gage a +s 
and (10.5.4) follows by equating the coefficients of 6%.* 

It is now easy to prove (10.5.2). For n>=mn (mod 5) for all n, and so 


R=21P=P (mod5), 


while Q=1 (mod5). 
Hence 1728 L7(n) a" = Q— R2=Q—P* (mod 5). 
roa) pray oO © 
But Q—P* = 288 Pa era: 8S wee 
1(1— eye #= level 
= 288 5 no(n) x, 
1 
so that T(n)=6r(n)=no(n) (mod 5). 


In order to prove (10.5.1), then, it is sufficient to prove that 
(10.5.8) a(n)=0 (mod 8) 
for almost all x. 


10.6. The congruence (10.5.8) expresses a special case of the more general 
theorem that 


(10.6.1) o(n)=0 (modk) 
for every & and almost all n.? We may plainly suppose that & is prime. 
We denote a general prime by p and a prime of the special form km—1 
by a, and write 
y n= i pr Ie. 
pu 


* Equating the constant terms gives 
a 
P=1-24 a omateets 
aca ~—ay? = ay? oo | 
which is easily verified directly. 
? Still more generaily, c,(n)=0 (mod &) for all odd s, all k, and almost all x. 
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ptt—] gett) 


Then oe) My res men ee 
Tf « is odd then e 
ott — 1 met 
mot =(ot+1) gic] =0 (modh), 


since w7+1=0 (modk) and the second factor is integral. It follows that 
(10.6.1) is true unless every special prime occurs in » with an even exponent 
a. Itis therefore sufficient to prove that a is odd, for at least one w, in almost 
all n. 


We define b, by 
b,=1 (if all a are even), 
56, = 0 (otherwise); 
and we have to prove that 
(10.6.2) Bin) = ¥ 6, = o(x). 
nNSx 
We can in fact prove more, viz. that 
Cx 
(10.6.3) Biz) ~ ioe 


where x = 1/(k—1) and C depends on k only. The proof is very much like 
Landau’s proof quoted in § 4.5-4.6. 
b 


Let Fis) = 2-4. 
Then F(s)= TI fee ce 1 Ae 
)= 11 pipet 1 tat Gite 


= 1 1 . __ $8) 
= Tip Daas * Tate) 
for o = i(s)>1. Hence 
log F(s) = log &(s)~ G(s), 


where G(s) = Llog (l+a-’) = Lo + Rs), 


and R(s) is regular for o>}. 

I must now take for granted some of the standard theory of the primes 
of an arithmetical progression. The function X'p-* behaves, in the ordinary 
theory of primes, “sufficiently like” log &(s); and Yw- behaves, in the theory 
of the primes of the arithmetical progression, sufficiently like «log &s). 
Hence log F(s) behaves sufficiently like (1~«) log (s), and F(s) sufficiently 


like {£(s)}**. In fact F(s) = {E(s)}"-* H(s), 
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where H(s) is regular at s = 1 and does not behave too badly for large s 


whose real part is nearly 1. 

If we accept all this, then the road to (10.6.3) is clear, since we have only 
to repeat Landau’s argument with comparatively trivial changes; and we 
have seen that (10.6.1) is a corollary. Here & is arbitrary; but the passage 


to (10.5.1) depends upon the special properties of 5. 
There are similar congruences for other moduli, of which the most note- 


worthy is 691. Ramanujan proved that 
(10.6.4) 7(n)=oy,(n) (mod 691); 
and Watson has deduced? that (as Ramanujan conjectured) r(x) is divisible 


by 691 for almost all ». 
I may observe that (10.6.4), for odd n, follows from (10.1.1). For then 


167(n) = — 128.2597(n) = 160,,(n) — 69175,(m) = 1604;(m) (mod 691). 


The order of r(n) 


10.7, L return to the problem of the order of 7(n), which I referred to at 
the end of Lecture IX. This is the most fundamental of the unsolved 
problems presented by the function. 

Ramanujan conjectured that 

(10.7.1) 7(p) $2p* 
for every prime p, or, what is the same thing, that all the angles , of § 10.3 
are real. I shall call this the “Ramanujan bypothesis”’. If the hypothesis 
is true then 


(10.7.2) [r(m)| = 0 TT sl) | Sn* [TT (A+) =n da), 
and so 
(10.7.3) (n) = O(n **) 
for every positive e. It is easy to prove, in the other direction, (i) that 
(10.7.4) a(n) zn 


* Actually, when k = 5, 


F(s) = &ls mae ay ie), 


where A(s) is regular for o>}, and L,(s), L,(s), L4(s) are the three Dirichlet’s L- 
functions associated with the characters 
d,4,-4,-1), G,-44,-1) (,-1,-1 1). 
> Using the theorem referred to in footnote 2, p. 167. 
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for an infinity of n, so that the 44 of (10.7.3) cannot be replaced by any 
smaller number; and (ii) that the truth of (10.7.3), for every positive n, 
involves the truth of the Ramanujan hypothesis.* 

We saw in § 9.18 that 


(10.7.5) t(n) = O(n). 
Ramanujan gave a more complicated proof of 
(10.7.6) t(n) = O(n’), 


and this is the most that has been proved by ‘‘elementary’’ methods. I 
proved in 1918, by the method used by Littlewood and myself in our work 
on Waring’s problem, that 


(10.7.7) t(n) = O(n’). 
Kloosterman proved in 1927 that 
(10.7.8) r(n) = O(n? *) 
for every positive e; Davenport and Salié proved independently in 1933 that 
(10.7.9) r(n) = O(n’? +); 
and finally Rankin proved in 1939 that 
(10.7.10) t(n) = O(n”), 


the best result yet known. The indices here are (apart from the e’s) less 
than 6 by }, 4, and } respectively. 


Proof that r(n) = O(n) 
10.8. I prove (10.7.7) by showing that 
(10.8.1) g(z) = A(l—z) (1-2)... = area 


uniformly in @. Here z = re and 0<r<1. Assuming this for the moment, 
we have lf gz) 


T(n) = 


~~ Qari cent *, 


? For (i) take n = p4, and observe that 
| sin (A+1) 0, 


> 
sin 6, 21 


for an infinity of A. For (1i) take the same n and suppose 6, complex. Then 
8, = kr £i,, where 7, is positive (since cos 8, is real), and 
sin(A+1)0,! _ sinh(A+1) %, 
sné, | sinha, 
is greater than a constant multiple of et» or né, where 
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where C is the circle r = e~¥". On this circle | z-"| = e, and therefore 
t(n) = of Max | g(re**) i = O(n), 
Poeew tin 

so that (10.7.7) is a corollary of (10.8.1). 

We divide up C, by the Farey dissection’ of order 

p=([Jnjt+l, 

into arcs &,,. It is sufficient to prove that (10.8.1) is true on &,,, with 
uniformity in p and q. 

If z= x? = e777, then 


f(z) = W4(7), 


in Tannery and Molk’s notation; and 


(10.8.2) h4(T) = (a+ br)!2h?4(7) 
. c+dr 
s D> pte 


is any substitution of the modular group I’. We take 


a=p, b=—4 onde, d=—p’, 


where p’ is defined by 1+pp’=0 (modq). 


2 
We write gee tg, Pst 
gq q 4g 
Then 
(10.8.8) z= ent — exp( — 2 27) 
q g 
sh SoG PN ie. 
q q 
so that 
a at oe 
(10.8.4) b=5- ( ig). 
ct+tdr_ d 1 py 1 pp 4 
Als = = = 
7 ator 6 ba+br) ga(p—ar) qa’ 
and 
Qn Qp'ni 
10.8.5 Z = eitT — ex (-F+ ). 
( Miede1@ 
Finally, (10.8.2), when stated in terms of g(z) and 9(Z), is 
(10.8.6) g(Z) = £3%9(2). 


* See §8.10. 
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Now 
] 
(10.8.7) [Z| =e P8T) = exp(- anit “| 
g, 
= |-ar | 
~ EE FE PY 
Qn 4n® An? 
Also lols Pp? <= <> 
on byq and gn < vn < 2n~-1, Hence there are constants A >0 and é<1] 
such that |Z|<e4=8 
on £9; and 
(10.8.8) |9(Z) | $| |Z | rn) || Z[*< Bl ZI, 


where B is another constant. 
It follows from (10.8.4), (10.8.6), (10.8.7) and (10.8.8) that 


| 9(z)| = ||? | 9(Z) | < Bl Si? | Z| 
2 B = | ne 4n?n-1 
nes GPP | ges BF) 
no 
b= TGt pA 
Since x5e~47*4 ig bounded for positive ys, it follows that 
gz) = O(n8) = O{(1—-r)} 
uniformly in p and q. This proves (10.8.1), first on the circle r = e-¥, and 


then, by the maximum modulus principle, generally." 
We have thus proved (10.7.7), and indeed a little more. We have 


= Brsyse-42, 


where 


Zr(m) rem = xl, | g (re!) |2d@ = O{(1 —r)*2} 


and a fortiort z 72m) 7?" = O{(1—r)-?} = O(n®). 
1 
Also yam = en emin = e-2 
if mSn, so that 
(10.8.9) S1(m) = O(n®). 


1 


This includes (10.7.7), and shows that 7(n) is O(n*") “in mean square”. 


* It would be sufficient that (10.8.1) should be true when r = e-1", 
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Further properties of F(s) = 2 om) 
10.9. Rankin’s proof of (10.7.10) depends upon a functional equation 


satisfied by 


(10.9.1) Fre al) 


ns * 
The simpler function F(s) of §10.3 also satisfies a functional equation, and 
it will be convenient to investigate this here. 

We saw in $10.3 that 


1 
(10.9.2) Fis) i zn me 1—7(p) p+ pu-*s 


for sufficiently large o. It follows from (10.8.9) and Cauchy’s inequality that 
n 
ZI 710) | = On) 


and that the series and product are absolutely convergent for o >43. The 
abscissa of non-absolute convergence depends upon the order of the sum 
function 


(10.9.3) T(x) = ¥ r(n). 
Now re) Fe)=["y-eyay,  * 
0 


where g(x) is the function (10.1.2), for 7 >423.7 But 


gle) = (Fy aterm) # 


tends exponentially to zero both when y->0 and when y->oo, so that the 
integral is convergent for all s, and F(s) is an integral function of s. Also 


Is) F(s) = enys {" ye ge-*7) dy 
qa 


- (enya f *att-ag(e~ dz = (27) (12-8) F(12-s), 
Q 


so that J'(s) satisfies 


(10.9.4) (277)-8 Fs) F(s) = (2n-2P(12~—s) F(12—s). 
1 When Ei r(n) [[yeternsdy = roy ze) eee. 
a 


* This is the special case g = 1, p = 0 of (10.8.6). 
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The behaviour of F(s) is “trivial’’ in the half-planeofabsolute convergence; 
in particular, it has no zeros whose real part exceeds 4%. The functional 
equation then defines its behaviour for o <4}. It has “trivial” zerog for 
$s = 0, —1, —2,..., but no other zeros whose real part is less than 1}. 

The “critical strip’, corresponding to the strip OSo7S1 for &s), is 
1L<o0<48; and there are problems concerning its behaviour in this strip 
corresponding to all the familiar problems about €(s). It has been shown, 
for example, that there are no zeros on the lines ¢ = 4 and ¢ = 48, and an 
infinity ono = 6. The “Riemann hypothesis’’ for #'(s) is that all the zeros, 
other than the “trivial” zeros, lie on 7 = 6. 

There is another generating function which may be made useful in the 
study of r(n), viz. 


(10.9.5) S(s) = Br(n) evn, 
It is easy to prove that 
(10.9.6) %(s) = (2a)? TG) 8% ran B 


but the properties of %(s) are not so interesting as those of F(s). 


Properties of f(s) = yn) 


ns 


10.10. We have now to investigate the corresponding properties of the 
function f(s) of (10.9.1); and we begin by finding an integral representation 
of f(s), valid for o > 12, viz. 


(10.10.1)  2(42r)-* I'(s) &(s — 22) f(s) ={[ | A(r) |? £(s, 2, y) dady. 


Here Tear, y>9, 
A(r) = ertrf(] pat e2niry (i pe e4ntr) - Pe 


1 


(10.10.2) &(s) = E(s,,y) = Bal orrerry ra 


when the dash has its usual meaning, and D is the fundamental region 


—$Sas}, |[wtiy|21 
of the modular group. 
It follows from (10.8.9) that the series for f(s) is absolutely convergent 
when ¢ > 12. From 


A (r) ee > T(n) ernriz—anay 
1 


* So that A(r) = g(e?7'7) in the notation of §10.1. 
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it follows, by Parseval’s theorem, that 
+ Ea) 
[i l4enae = Eeteeom, 
-+ 1 
Further, if we suppose 7 > 12, we have 


(10.10.38) (4n)*I'(s)f(6) = 7%(n) | "plete ii 
=["r(z Tn) etmn) dy 
0 1 


= [yay]! ac) [2 dae ={{ el Ar) |°dedy, 


where S is the strip —is2xsh, y>0. 
Since A(r) = O(e-277) 
for large y, and. A(r) = Oly-8) 


for small y,* uniformly in %, the integral 


Ff Alar aes 


is convergent for o > 12, and this justifies the transformations. 


10.11. We call a modular substitution 
—c¢+ar' 
d—br’’ 
or T, a substitution 7, ifit transforms points 7’ of the fundamental region D 
into points 7 of a triangle D, lying in 8. Since a, b,c, dand —a, —b, —c, -d 
give the same substitution, we may suppose that 
(10.11.2) bS0, d>0ifb=0, (6,d)=1. 


Any T transforms D into a triangle lying in a strip 


(10.1.1) t= T(r) = 


n-tsasnth, 
and then T=T(r')—n 
is a Ts; so that there is a Ty corresponding to any pair (b, d) which satisfies 
(10.11.1). Further, if (b,d) is such a pair, and (a9,¢)) a pair which, with 
(6,4), gives a Ts, then the general solution of 


ad—be=1 
is a=a+nb, c=e,+nd; 
and then  _ ~eotMeT’ | 


« By (10.8.1). 
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gives a Dr outside S except when » = 0. Hence there is just one Zs corre- 
sponding to any pair (0, d) satisfying (10.11.2). 
Now 
(10.11.38) (47)~* P(s) f(s) = z | ys | A(z) |Pdady. 
Ds Dr 


If (10.11.1) is the 7g which transforms D into Dy, then elementary calcula- 
tions show that ; 


= dr 1 
Y= Ta-br  ldr'|~ [d—6r' 
Waele oh da'dy’ 
dady ar’ da'dy [dr 
Also A(t) = (d—br’)" A(z’). 


Hence, substituting from (10.11.1) into (10.11.3), writing m for —b, for d, 
and then dropping the dashes, we obtain 


yo 
(10.11.4) (41)-8 F'(s) f(s) = py {fh d= br ps | A(r) |?dady 


= [[ ole) P Fee, ded, 
D 
where 


(40.11.5) F(s,7) == : 


and the summation is defined by 


(10.116) OSm<a, -mMm<n<m, (mn)=1, n=lifm=0. 
Finally we multiply both sides of (10.11.4) by 


oY] , ot 
26(28— 22) = 2 em rs x | he [2 


where the dash excludes the value & = 0. It is plain that 


2¢(23 — 22) F(s,7) = £5" laa = Ea £(s), 


where the dash now excludes the pair m = 0,2 =0; and so we obtain 
(10.10.1). 


10.12. We now require certain properties of the function 
(10.12.1) K(w) = £2’ exp(=72|mr+n('), 
where w > 0. The most essential is the functional equation 


(10.12.2) 1+K(w) = n(r+x(5}}. i 
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This is a simple corollary of Poisson’s summation formula for functions of 
two variables, viz. 


(10.12.38) EEG (m,n) = SE | I BE, ) erriomt+nn dE dy, 


where all summations and integrations are from —« to oo. In this case 
1+K(w) = 22 4(m,n) 


with #60) = exp|- "| Ble+iy) +9). 
If we substitute this value of ¢ into (10.12.3), and use the formula 


| | 28? 28 £4 —y E24 Qn imEtng) di dy 
Faq [ayn em mtr, 
where a, y, and ay — f? are positive, we obtain (10.12.2). 
We shall also need an upper bound for K(w) for large w. This is 
(10.12.4) K(w) = O(yetx), 
and holds uniformly for w>J1 and 7 = 2++y in D. 


We have K(w) = 22" exp (- FP fone emp +m'y9) : 


The combination m = 0, = 0 is excluded. Also y= 4/3, since 7 is in D. 
(i) The terms m = 0, n+0 give 
Qe-mVly 


ao ae 
—7?- 
2Be VAIS OE Ee BONE os) Sea 


which is O(y/w) when w/y is small and O(e-*“”) when w/y is large; and so 


of( L+ z) ew = Ofye*lv) 
in any case. 

(ii) There remain the terms for which m+0. If we fix m, then there is 
at most one for which |matn|<. 
This m and n give O(e-""7””), and summation with respect to m then gives 

O(e-72¥) = O(ye-*tmiv) 
The other ” give two sequences whose numerical values exceed 4, 2, 3, ...; 
and so (for the fixed m) we get 
Of(e-trwely 4 e-tmwly 4.) e-merwr}, 
which, as in (i), is Ofye tele, emer} 


7 y>} and y>hy. 
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Finally, summing with respect to m, we get 
Ofye truly i emmy = O(ye tv), 
This completes the proof of (10.12.4). 
If y is fixed, then 
(10.12.5) K(w) = O(e-4¥) 


for large w and a positive A. 


10.13. We can now determine the principal analytical properties of £(s), 
It is defined by (10.10.2) when o> 12. Also 


gy ea) 2 [yet (-= | 
(") [oar pn [Ee ae exp 7 |mr+n |?) dw 


for g>11, and so 


(10.13.1) (“yr T(s—11)&{(s) = [Purseze exp(-— | mr+n r) dw 


= [Pex (w) dw, 


it) 


if the summation under the integral sign is legitimate. This will be so if 
[ wr K (w) dw< oo. 
0 


This integral is convergent at infinity, whatever o, because of (10.12.5). 
Near the origin 1 1fl 
K(w) =- 14242 (5) 

wow \w 
behaves like w-’, and the integral is convergent for ¢ > 12. Thus (10.13.1) 
is true for all such ¢. Also 


e-11 
(10.13.2) (¥) I(s—11) &(s) 
= i WK w) dw+ i so si + + = K (;)} dw 
1 


=| wrt be 
[rw K (w) dw con 


al s 11—s 
=ptle E(w) dew 


1 ao 
= pampeaagt | ot Keo a 


The integral here is an integral function of s, by (10.12.65), for any r of D; 
and the right-hand side is unchanged by the substitution of 23—s for s. 
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Thus &(s) is regular over the whole plane, except for a simple pole at s = 12; 
and 


(10.13.3) (4 re- 11)&(s) = (2)" "702-2 £(23—8), 


10.14. We deduce the properties of f(s) from those of &(s). It follows from 
(10.10.1) and (10.13.2) that 


(10.14.1)  2(477)-8 (8) F(s— 11) (28 — 22) (8) 


ex poll ie y| A(z) i 11) (4) ew] dudy 


qs-il 
(s— 11) (@— 12) fe"! [aedy 


+n | | y | A(r) |2dady ( * wi 22 4 wits) Kw) de, 
D 1 


Now for any real c, and 7 in D, the integral 


es (w) dw 


is majorised, after (10.12.4), by a multiple of 
wo 
y i ure trey dy, 
1 


If c20, we replace the lower limit 1 by 0; ifc <0, we omit the w* and make 
the same change in the limits. In either case the integral is less than 


Ay’, 
where A and y depend only on c.’ Hence the treble integral in (10.14,1) 


is majorised by ; 
An7-2 { y? | A(r) |Pdady, 
D 


where A and ¢ depend only on oc; and this integral is convergent for 
every o, because A(r) = O(e-270) 
for large y. 

It follows that the treble integral is absolutely and uniformly convergent 
throughout any bounded domain of values of s, and represents an integral 
function of s; and that 


2(47)-8 I'(s) I's ~ 11) £(28 — 22) f(s) 


Tyo c+2ife20,y = 2ife<0. 
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is regular for all ¢ except for simple poles at s = 11 and s = 12. Hence f(s) 
is a meromorphic function of s regular except for 
(i) a simple pole at s = 12, with residue 
(4 
(10.14.2) a= 12 ll y?| A(r) |2dady. 


(ii) poles corresponding to the complex zeros of ¢(2s — 22), all lying to the 
left of o = 12. 
Finally, it follows from (10.14.1) that 


(10.14.38) (2n)-8* Is) P(e — 11) (28 — 22) f(s) 


= (2n)%-46 7(23 — 8) (12 —s) (24 — 28) f(23—s), 
Le. that 


(10.14.4) G(s) = $(23 —s), 
where 
(10.14.5) (s) =(20)-* I's) P'(s— 11) (28 — 22) f(s). 


It also follows from these properties of f(s), and a well-known theorem 
of Ikehara, that 


(10.14.6) 71) +79(2) +... +72(n) ~ gyant®, 
This was a new theorem; but Rankin has shown that it is possible to go 
much further by using a theorem of Landau. 

10.15. Landan’s theorem’ is as follows. Suppose that 

(1) 6,20; 


(2) Z(s) = 
is absolutely convergent for o> 1; 


(3) Z(s) is regular all over the plane, except for a simple pole at s = 1, with 
residue f; 


(4) A(s) = O(er't!) 
for large |t| and some y = (oy, 02), in any strip 0,50 <0; 

(5) T(s)f(s+11) Z(s) = 1(1—s)F(12—s) Ze, (An), 
the last series being absolutely convergent for o <0; 

(8) x 2\e,| = Oe 

RSz 

Then 

(10.15.1) C(z)= Y ¢, = Put O(a). 

CnSz 


* The “trivial” zeros of the zeta-function are cancelled by poles of P'(s—11). 
* Particularised for the application. 
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We show that 
(10.15.2) Z(s) = €(2s)f(s + 11) =z 


satisfies these conditions. 


(i) Sinee 3 

(10.15.3) Bis) = Ba, a 
we have 

(10.15.4) ¢,= YS wr uw) 20. 


pan 
(ii) Both series in (10.15.3) are absolutely convergent for o> 1,’ and 
therefore the product series is so. 
(iii) We proved that Z(s) satisfies condition (3) in§ 10.14. The value of f is 
B= 40x, 
(iv) Tt is plain from (10.14.1) that Z(s) satisfies (4). 
(v) The functional equation (10.14.3) may be written as 
I(s)P'(s+1]) Zs) = (27)? F(1 —s) £12 ~s) Z(1 ~s) 
= I'(1—s) P(12—s) 2e,(1674n)5, 


= fn 
~ 472” 


where en 


The series is absolutely convergent for 7 <0. 
(vi) It remains only to verify condition (6), which is 


C(a) = zon = O(2). 


Now X fe 'r*(u) = O(z), 


Bsa 


by (10.8.9) and a partial summation; and, by (10.15.4), 
O@)= F arn) = BNA) + Bw U) + Sw H) +... 
LEQ 


Bisa HShe BSAC 
= O(x) + O(}2) + O(ha)+... = O(a). 


Hence 4(s) satisfies all the conditions (1)-(6), and (10.15.1) is true. 
The formula which we actually use, however, is not (10.15.1) but 


(10.15.5) D(z) => ni¢,, = de, he 4 O(ai?-8). 


NEX 


* The second by (10.8.9). 
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This follows from (10.15.1) by partial summation. In fact, if [x] = 2, we have 


D(x) = 14C(1)+ 3 vA{C(v)— C(v— 1} 


= 20@)—S (v4 2-9 00) 
= {a!1+4 O(a} {Bat O aso 1y0 4. O(v9)} (Av + O(vt)} 
= Po(1 —H4) + O(a!?4) = pfx? + O(2!?-4), 


10.16. It is now easy to prove that 

(1016.1) -72(1) + 72(2) +... +72(n) = gyaunt? + O(n}2-4), 
and to deduce Rankin’s theorem (10.7.10). 

It follows from (10.15.38) that 
(n) __ Z(s—11) = ges n®2u(n) 

ne — &(28— 22) nes ? 
7(n) = Y Hey. Pal), 
kon 


y 7(n) = - eee) 
nSe Ls. 
= 3 Pd, Be = 3, Py (2) 


isat igat 


= E,PHolra la) (a) 


by (10.15.5). The main term here gives 


2 


TP ae sy HL) _ me of 1 Pe eee re 
nae ca Bo IE” Fay t Oe )} = ppoew? + O(a), 


and the error term gives 
O(at*+#) F - = O(n}?-4). 
Isat 

Hence we obtain (10.16.1). 

Finally, replacing n by »—1 in (10.16.1), and subtracting, we find that 

7(n) = O(n") + O(n)?-F) = O(n??-4), 
t(n) = O(n8*), 
which is (10.7.10). 
The sum function T(n) 

10.17. There is another set of problems connected with the “sum 

function” 


(10.17.1) T(n) = YY 1(n) 


nSX 
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of r(n). The dash over the 2 implies, as usual, that when x is an integer the 
last term 7(x) is to be replaced by $7(z). 

t can explain these problems best by stating their analogues for a more 
familiar arithmetical function. If (2) = r,(n) is the number of representa- 
tions of 7 as the sum of two squares, then the sum function 


(10.17.2) Ra) = X' r(n) 
RSZ 
is the number of lattice points in the circle 
w+tesa, 


with the convention that we count } instead of 1 for a lattice point on the 
circumference of the circle. It is familiar that 


(10.17.3) R(x) = m2 + P(x), 
where P(z) = O(@4); 
but the true order of P(x) is still obscure. Sierpinski proved in 1906 that 
P(x) = O(z+), 


and this result has been improved by van der Corput and later writers. In 
the other direction, Landau and I proved that 
P(x) + O(a). 
Another problem is that of the “‘identity’’ for P(x). It was stated by 
Voronoi in 1905 that 


(10.17.4) P(x) = at % ay SF{2n (nee), 


where J, is the Bessel function of order 1. I proved this in 1915, and 
many other proofs have been published since. 

There are similar “order” and “‘identity’’ problems for T(x). They are 
naturally less important, and any order results are bound to be imperfect 
so long as we are uncertain about the true order of 7(7) itself. The best that 
we can prove at present is that 


(10.17.5) T(z) = O(att), 
Ifthe Ramanujan hypothesis is true, then 

(10.17.6) T(x) = O(a? +), 
It is certainly not true that 

(10.17.7) T(x) = o(a**). 


In this case the identity is perhaps more interesting. It is 


(10.17.8) T(e) = 28 3 10), 4(4m(n2. 
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Wilton proved that 
7(m) 


(0.17.9) T,(x) = Flexi Dy (2—n)*7(n) = xStte 3% aris Janse {4er(na)8} 


for «>0, and I proved more recently that this result is still true for a = 0. 


NOTES ON LECTURE X 


§§10.1-3. Ramanujan, in no. 18 ofthe Pagers, considers the function y,(7) defined by 
a{(1 — ata) (1 — x f8le) Je = Dip, (n) 2%, 


where «| 24. When « = 24, y(n) is 7(). 

When « is 1 or 3, ¥,(n) is given by the identities of Euler and Jacobi. When a = 12, 
w,(n) is a multiple of the e,.(n) of §9.3. It is 0 if m is even, and can be defined when 
n is odd by a sum extended over the representations of n as a sum of 4 squares; the 
formula will be found in Glaisher’s paper referred to m the note on § 9.1. 

Ramanujan conjectured that ,(n) is multiplicative in every case, so that 
Zn-"f,(n) has a product expression analogous to (10.3.8); and he deduced formulae 
for #,(n) when @ is 2, 4, 6 and 8, All these conjectures were confirmed in Mordell’s 
paper 2, Mordell says there that the multiplicative proporty of ¥4.(n) had been proved 
before by Glaisher, but this seems to be incorrect: see Glaisher, Quarterly Journal of 
Math. 37 (1906), 36-38. 

Rankin, in an unpublished manuscript, has found elementary proofs of the multi- 
plicative property of y(n) and y,,(%). These are substantially Glaisher’s Q(n) and 
O(n). 

$10.4. The properties discussed in §§10.4-6 were all enunciated by Ramanujan 
in a manuscript ‘‘Properties of p(n) and 7(n)” now in Prof. Watson’s possession (see 
also no, 28 of the Papers). This manuscript has never been published in full, but no. 30 
of the Pagers is substantially an extract from it, and Watson, in his papers 23 and 24, 
has since rewritten and completed a good deal morc. 

The proofs in §10,4 were given by Mordell (1), but stand also in Ramanujan’s 
manuscript: Ramanujan of course assumes the multiplicative property of 7(n). 

§§ 10.5-6. The proofs here are derived partly from my old notes on Ramanujan’s 
manuscript and partly from Watson’s paper 23. Watson works out in detail the proof 
sketched in §10.6. 

If we replace x by g?, then P, Q, R are 


12,0, 29,04 216g,w% 


7m? aie 7” 
in the ordinary notation of elliptic functions; and (10.5.3) is equivalent to 
A= gi 2793. 


See § 9.11. 


§§ 10.7—-8. See Hardy (9); Hecke, Hamburg math. Abhandlungen, 5 (1927), 199-224; 
Kloosterman, ibid. 337-352; Salié, Math. Zeitschrift, 36 (1932), 263-278; Davenport, 
Journal fiir Math. 169 (1933), 158-176; Rankin (2). Davenport does not mention 
t(n) explicitly, but (10.7.9) is, after Kloosterman’s work, an immediate corollary of 
what he proves. 

Everything that has been proved about the order of 7(7) has its analogue for the 
coefficients c, in the expansion 


co 
> Cy e2nniriN 
1 
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of any modular form of ‘Stufe’ N and dimension —k which vanishes at all the rational 
points corresponding to cusps on the real axis in the modular figure. Here ‘vanishing’ 
means “having the limit 0 for approach along a perpendicular to the axis’’, The 
special function g(e?7*7) ig of Stufe 1 and weight — 12. The general ideas of the theory 
ofsuch modular functions were laid down by Hecke, who proves the result for general 
modular forms which corresponds to (10.7.7). 

The proof of (10.7.7) in Hardy (9) is arranged differently, the underlying Farey 
dissection not being mentioned explicitly. 

L show there that there are constants A and B such that 


An® <7%(1) +722) 4...4+72(n) < Bak, 


This result is now superseded by Rankin’s theorems (10.14.6) and (10.16.1). 

810.9. The functional equation (10.9.4) must have been familiar to Ramanujan, 
but I cannot find an explicit statement of it either in the Papers or in the note-books. 
It was first stated in print by Wilton (1), who proves it as a special case of a functional 
equation for 


s 


F(s,p,9) = yilt) g2npniia, 
n 


Wilton (1) proved that #(s) has an infinity of zeros on a = 6, and Rankin (1) that 
it has none on o = 48 or o = 43. 

§§ 10.10-16. The proof is that given by Rankin (2), with a few small simplifications. 
His proof of the corresponding theorems for functions of Stufe N is more complex. 

§10.12. For Poisson’s formula see, for example, Bochner, Vorlesungen tiber Fourier - 
sche Integrale, 33-38 and 203-208, or Titchmarsh, Fourier integrals, 60-68. The case 
here is a simple one. 

§10.14. Ikehara’s theorem, in the form relevant for its application here, runs: 
af Gn 20, and La,n-* is convergent for o> 1 and represents a function regular for ¢21, 
except for a simple pole, with residue C, at s = 1, then 

= aa~ Cau. 
NSE 
The theorem (which has been referred to already in § 2.8) is based entirely on the ideas 
of Wiener. There is a simple proof of the theorem, in a rather more general form, in 
a paper by Bochner in Math. Zeitschrift, 37 (1933), 1-9. 

§10.15. See Landau, @édttinger Nachrichten (1915), 209-248. 

§10.17, The ‘circle’ problem has been referred to already in Lecture V ($5.1 and 
the note on that section). 

My first proof of the identity (10.17.4) occurs in a paper in the Quarterly Journal 
of Math, 48 (1915), 263-283. A number of other proofs have been given since; for 
references see Hardy and Landau, Proc. Royal Soc. (A), 105 (1923), 244-258. Perhaps 
the best proof is that in Landau, Vorlesungen, ii, 221-232. 

Of the theorems about T(x) stated at the ond of the section, (10.17.5) is Rankin’s, 
(10.17.6), (10.17.7), and (10.17.8) my own. For (10.17.6) see Rankin (3), for (10.17.9), 
Wilton (1), and for (10.17.8), Hardy (10). No proofs of (10.17.6) and (10.17.7) have 
been published. My own proof of (10.17.7) depended on the identity (10.9.6). 


Dl, 
DEFINITE INTEGRALS 


11.1. In this lecture I propose to speak about some theorems of 
Ramanujan which have not attracted very much attention, which are, as 
I said in my opening lecture, “inevitably less impressive” than much of 
his work, but which are still very interesting and will repay a careful 
analysis. 

Ramanujan was occupied, during most of the last year before he came 
to England, with general formulae in the theory of definite integrals. He 
then held a research scholarship in Madras, and submitted three quarterly 
reports to the University on the progress of bis researches. It is from these 
that most of the formulae which I quote are taken. I owe my knowledge 
of the contents of these reports to Professor Watson, who included them 
in his copy of Ramanujan’s notebooks; but most of the formulae which 
I quote had been shown to me by Ramanujan himself. 


11.2. The formulae which I take as my text are as follows. 


(A) {°=@0)-26) +2462) de = 


o(-8). 


sin 7 


(B) [Pesh@-F = xi) +a (2)— vf = I T(s)A(—8). 


(C) (0) + 9) + G(2) + 


(0) —a(1) +26 (2)—... 
= [i deraes [Oia tesa 


(D1) [Pp Faay+F Ae)... eosyede 

“A? SA aay Maree 
(D2) Pho-¥ Ft 5A(2)— | singed 

= A -2)y-A(—4) YF + A(—B) YF — 
@ | [aio — Fa) + No) Fp +] de 

= A—1)2(0)—A(— 2) n(1) + 
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] 


(Fl) fF, Flee) G(fx) da = ee 


a2) fey= [Fear rar, a(6)= [° ee)ede, 
then 


(F3) f()gd-8) =>. 
And if 

(F4) = A(z) HF (yat)+ P(—yxi)} de = By), 
then 

(F5) fea Bix) 4G yat) + G(— yxi)} da = 41 A(y). 


I shall say something about each of these formulae in turn, but I must 
begin with some general remarks about Ramanujan’s treatment of all of 
them. He had no real proofs of any of the formulae; and here I am using 
the phrase ‘‘real proof’? not quite in its ordinary sense but in one which 
I must explain. 

It is reasonable to say that we now know, roughly, the conditions for the 
truth of most analytical formulae. We can say, for example, that a formula 
like (A) is true for certain ¢ and certain s, and that the conditions which we 
impose on ¢ and s are ‘‘natural’”’ conditions; they do not intrude merely 
on account of the weakness of our analysis, but are genuine limitations 
corresponding broadly to the facts. Our theorems will not cover all cases 
in which the formula is true, and it may be interesting and profitable to 
do what we can to extend them; but the conditions under which we have 
proved them would become insufficient if widened in any really drastic way. 

A mathematician may have stated a formula and advanced reasons for 
its truth which are inadequate as they stand, in which case he cannot be 
said to have ‘proved’ it. But it often happens that his method, when 
restated and developed by a modern analyst, leads to a proof valid under 
“natural” conditions, and in that case we may fairly say that he has 
“really” proved the theorem. Thus Euler “really” proved large parts of 
the classical analysis, and there are a great many theorems which Ramanujan 
had “really” proved; but he had not “really” proved any of the formulae 
which Ihave quoted. It was impossible that he should have done so because 
the “natural”? conditions involve ideas of which he knew nothing in 1914, 
and which he had hardly absorbed before his death. He had also, as 
Littlewood says, no clear-cut conception of proof: “if a significant piece of 
reasoning occurred somewhere, and the total mixture of evidence and 
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intuition gave him certainty, he looked no further”’. In this case any “real” 
proof was inevitably beyond his grasp, and the “significant pieces of 
reasoning’’ which are indicated in the notebooks and reports, though we 
shall find them curious and interesting, are quite inadequate for the 
occasion. 


Formulae (A) and (B) 

11.3. The first two formulae are the most important, and I shall discuss 
them in more detail than the rest. Ramanujan was particularly fond of 
them, and used them as one of his commonest tools. They are variants of 
one another, (A) becoming (B) when we write 


A(u) 
#4) = Fa say? 


and we may take (A) as the standard form of the formulae, though we shall 
sometimes find (B) more convenient. 
It is easy to give examples of both the truth and the falsity of (A). Thus 


Hu) = 1, OO) = TE 


give the formulae 
o gs-l T oO _ = _ ee ee ay, 
[ pera ~ sin sm’ ie i ac sins7I(1—s) Ts), 


true for 0<s<1 and for s>0 respectively. On the other hand the formula 
is plainly false when 


d(u) = smu, 


the integral then vanishing identicaJly. The formula is an ‘‘interpolation 
formula”, which defines ¢(—s) in terms of (0), #(1),.... If it has been 
proved for all ¢(u) of a certain class, then any $(u) of that class vanishes 
identically if it vanishes for all non-negative integral values of u. There is 
a well-known theorem of Carlson which says that if (i) d(w) is regular, and 


| d(u) |< Cell, 
where A<z, in the right-hand half-plane of complex values of u, and 
(ii) (0) = (1) = ... = 0, then (uw) vanishes identically; and it is natural 


to suppose that (A) should be true for all such ¢(u) and appropriate s. We 
may also expect that the best methods for the discussion of the formula 
will be those with which Carlson and others have familiarised us and whose 
original source is to be found in Mellin. 
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Proof of (A) 
411.4. In what follows I write 
us Ubiw. 


I denote the half-plane v2 —6, where 6>0, by H(6). For the present 


T suppose O<é<l. 
If g(a) is regular, and 
(11.4.1) | B(u) |< CePer anes 


throughout H(6), then I shall say that ¢(u) belongs to the class @(A, P, 6), 
or simply &. We shall be concerned for the most part with functions for 


which 

(11.4.2) A<m. 

Let us suppose now that ¢(w) belongs to #(A, P, 6), with A<7,0<é<1; 
that 0<c<6é; and that 


(11.4.3) O<a<e-?, 
Then a simple application of Cauchy’s theorem gives 
(11.4.4) 
1 ete T 
O(c) = 6(0) —e9(1) +9%9(2)—... =e ed — werd, 


The series is usually divergent if z>e-”, but the integrand is majorised, 
for all positive x, by a multiple of 
etn Altwle~Peg—e, 


and the integral converges uniformly in any interval 0<a,Sx<X; and 
therefore ®(x) is regular, and represented by the integral, for all positive .? 
We can now deduce (A) by the well-known inversion formula of Mellin. 
This is expressed by the equations 
ro) e+to 
f(u) -| F(z) ada, F(x) a f(u)atdu; 
0 


2rrt J e—tco 
and (A) will follow if we can take 


f(u) 


We may justify the use of Mellin’s formula in various ways, appealing, if 
we please, to some established general theorem; but it is more interesting 
to give a direct proof on the lines originally followed by Mellin himself. 


7 
sin ur 


o(—u), F(a) = P(e). 


‘Tf a = re’, then |---| = r-%e9v, and G(x) is regular for |@|<2—.4; but we do 
not use this. 
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Suppose that s = o +7¢ and 
0<a<64, 
and choose ¢, and ¢, so that 


O0<e,<0K<0,<d. 


Then (11.4.4) is true both with c = c, and with c = ¢. 
Now 


1 ; 1 1 sig Gatio aig 
is O(c) ede = sail, x of aa d(—uja du 


ett 1 tt pee 
a o(—u)du ade = i i STEN 
0 


io SiN 7U 271 J e-io SIN TU S—U 


TY SF ey 


since R(s—u) = o—¢, > 0. There is no difficulty in the inversion, the double 
integral being absolutely convergent. Similarly 


[Po e-tae = |e ay x ast dy 


270 J cio SIN TU 


= 1 ae qT H=%) 4, 


2771 J ~to SIN TU S—U 


since now #t(s—u) = 7-—c,<0. Combining these equations we obtain 


0 a+ E60 ato = 
[lovee [eat Bete ala 
0 eres sin7u s—u sin. s7 


by Cauchy’s theorem. 


11.5. We have thus proved (A) for a d(u) of ® and for 0<a0<6, in 
particular for 0<s<6. Itis plain that, if d(w) = O(e4!!), then $(w) belongs 
to ®, with P = A, so that our class of J(w) includes the Carlson class. 
Hence (A) gives incidentally a proof of Carlson’s theorem. 

We may also reduce the second part of the proof to an application of 
Fourier’s theorem. If we put x =e, u =c+ iu, in (11.4.4), it becomes 


1 Tw 
4) = ff — 6 — dy) elt day: 
Pen) Qa shaunieniay ee eee 


and Fourier’s theorem then gives 


aG(—e— to) on tw) . —E(c+-t1} 2. A 
sinm(e+iw) }_a° Bert B(e-) dé =|) aetio-1B (a) das, 


+ Such a reduction of a “Mellin” to a “Fourier” transformation is of course 
always possible, the transformations being formal variants of one another. 
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I add one remark which applies not only here but often later in the 
lecture. I have supposed A <7, in which case all the integrals occurring 
are absolutely convergent, and with a good deal to spare. In many of the 
most interesting examples A will be equal to 7; and the final integral may 
not be absolutely convergent, in which case a more delicate argument, not 
depending simply on majorisation, will be needed. The hypothesis A <7 
is, here and elsewhere, a crude hypothesis; but it is at any rate a ‘‘natural”’ 
one, the results being usually false when A>7; and I shall have no time 
for any more subtle analysis. 


Heuristic deductions of (A) and (B) 


11.6. The formulae (A) and (B) are connected in a very interesting way 
with “Newton’s interpolation formula”’. 
It is well known that 


a co nn, 
eo 5 (— 1) Stan =e <4 9 an 
0 n! 0 n! 


where Adig = Gy—4 1, Ay = My— 2A, +A, 0... 
” Hence, if s>0 and the series 

Oe 
Ale) = A(0)-=>* are 
is convergent for all x, we have 


[cea tae -|° eas age) a” dav; 
0 


and term-by-term integration gives 
eo _ 0 
x (0) 


ye e-tgttnmldas = rea +5 74400) + 49) 4n(o) +... 


Finally 
(UL61)  A(—5) =A) £420) +2 4900) +. 


is Newton’s formula. 

The term-by-term integration can always be justified when the resulting 
series is convergent. We can therefore prove (B) on the assumptions that 
(i) 8 is positive, (ii) A(z) is an integral function, and (iii) Newton’s formula 
holds for A( —s). If (i) and (ii) are satisfied, then (B) is equivalent to Newton’s 
formula. 

There is a, corresponding reduction of (A), based on Euler’s transformation 

y x 


YO aes 


a T+2" 
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It is convenient to suppose that ¢(0) = 0. Then, writing 5, for d(n), we 
have formally 

bt —b,2? + ba? —... = b.y+4b,.y? + A%,.y2 +..., 


n+1 
—1 a 1 n, 
[oe a) oda [2 34%,(22) dz 
= a = P(s+n+1) 
=- Sa, |" Tegan I 8) 24%, T+) 
+1l)(s+2 
“3 {+@+1)4b, Ett) apse. ; 


In this way (A) also may be reduced to Newton’s formula.” 


Ramanujan’s argument 
41.7, Ramanujan supposes, in effect, that 
(1.7.1) A(u) = x(e-™), 
where a is positive and +(z) is regular at the origin. Then 
= (=1)" 


Ate) = a amgteren 
(EU 2 XO), aon 
n=o 7! r=o 1! 
= > x0) 3 (- 1)? ne-arn = > x"(0) en-tenar 
roo T! nao ! rao 7! 
and A(z)as—de = ¥; x'(0) * greene ysl dn 
r=0 r! 0 


Pe) Pr = PE) gle) = Te) M-s) 


for s>0. The inversions can be justified if (i) y(z) is regular for |2| <1, 
and (ii) s is sufficiently small. For then 


is convergent for some Z>1, 
x1 x(0)| Vy. J meron = SLO) VI vera 


is convergent, and 
iP 0 foo] 
yl x ) if ete aya] Jn aN rez! x*(0) | ears 
rt 0 rl 


* (11.6.1) with @(u+ 1) for A(u) and s+1 for s. 
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is convergent for sufficiently small positive s. In these circumstances 
Ramanujan’s analysis is valid. But the condition on A(w), which demands 
that A(n) should be of the form 


Cy +6, e-9" + Coe-22" + , 


for large n, is extremely stringent and, though it may be extended a little, 
excludes practically all of Ramanujan’s examples. 

Itis instructive to consider a case in which an argument like Ramanujan’s 
leads to a demonstrably false result. Suppose that, in (11.7.1), a =—b is 
negative. Then, if we repeat Ramanujan’s calculations, we obtain 


SX) | ne 
(11.7.2) A(x) <2 a, 


and can apparently complete the proof as before. But (11.7.2) is usually 
false, since A(x) is regular at the origin, while the series on the right represents 
a function of « which is regular in the right-hand half-plane and has the 
imaginary axis as a singular line. Thus the assumptions 


x(u) =e = (e>0), b = log2, A(u) = ee?" 


“would lead to (<9 eo = 3 (~ ys ve, 


oMs 


which is obviously false. 


Examples and applications 
11.8. I quote a few of Ramanujan’s special cases. 
(i) If0<s<Min (a, #), then 
A Fy) Is) P(a-s)Fe- 
1 aime = 
[oe Febrn nde = neg Tena 


Here F(a, £,y,x) is the hypergeometric function, defined for 0<a%<1 by 
the usual power-series and for 2 >1 by analytic continuation. 
(i) If0<s<1, then 


[Pane 2-en 3-008 Jjde== (l-s)~. 
sin s7 
(iti) If 0 <q <1, s>0, and 0<a<qs-, then 
[oe (L+age)(1+ag’)...  g,_ _™ ppg) (ag) 
0 (1+2) (1+¢x) (1 age. sin sa“, (1—g™) (1—ag™~*) 


(iv) If 0<s<4, then 


ce (eat __ Is) 
[f= Doreen 4) = Fa 28)" 
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Of these, (i) and (ii) are straightforward corollaries of what we have 
proved. To prove (iii), we use the expansion 


_ _(L+agz)(L+ag?e)..- SF) _ yyn (Lag) (Lag?) ... (L-ag”) 
P(a) (1+) (1+gx)(1+q@s)... 2 z (1—g) (l—g*)... (1g) a 


which is easily deduced from the functional equation 

(l+agx) O(gu) = (1+2) D(x) 
satisfied by B(x). Here 

= (1—agm) (1g) 
ee) = TN gry (dag) 

Finally (iv) is a formula found independently by M. Riesz, and used by him 
to determine a very curious necessary and sufficient condition for the truth 
of the Riemann hypothesis. On this hypothesis, the formula is true for 
O<s< . Analternative form of @(z) is 


LEM) stmt 
Ao ae ee ! ? 


where (m) is the Mébius function. 


11.9. We can use Ramanujan’s formulae to obtain many expansions 
usually derived from Lagrange’s or Burmann’s series. I take two examples 
given by Ramanujan himself. 

(i) A problem familiar in text-books is that of expanding e~** in powers 
of xe’, Ramanujan argues as follows. If y = we and 


ax = n(n) ‘Te 
é | ae ae 


then, by (B), 
F(s)A(-s) = i ye te-8 dy = [Pea + ba) e280) dx 
0 0 


= I(s)a(a—sb)-s-1; 
so that A(n) = a(a+nb)"-1, 
The argument obviously requires that @ and 6 shall be positive.’ 


(ii) It is well known that the roots of trinomial equations can be expanded 
as hypergeometric series. Ramanujan finds an expansion for any power 2° 
of a root of 


aga? +22 = J 
bod — te 
as follows. If f= 3! 1) MA) an, 
0 n!} 
* When the function 2 Sie oe 


does not quite satisfy the conditions of § 11.4 (the “A” being 7). 
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then, by (B), fi A 
= — a lar = ta aa a 1-3 
Is) AC j=], asytda ae ] af a ). 


Calculating the integral, he finds the expansion 


; ptt 2p—9) 12_ r+ 38p—9) (r+ 38p—29) 
a ae nines al 31 


I do not know whether this formula is new, and I have not attempted to 
find conditions under which the analysis can be justified. 


Formula (C) 


11.10. The formula (C) may be regarded as one for the remainder in 
the “Euler-Maclaurin sum formula”. Since 


= dx ody 1 
» x{7?+(loga)} Jiom+y® ” 


we may multiply 6(0), in the two places where it occurs, by any constant 
-factor. It is convenient to write the formula as 


1 @ $6(0) —aG(1) + 2°A(2)—... 
110) +801)+6(2)+...— | “playde = |” On 


or, replacing @(u) by y"p(u), as 


f°? Aakv) », 
(11.10.21) Rly) -[. Bee oss Aan 
where now 
(11.10.2) G,(z) = $6(0) + 5 d(m)2%," 
I 
and 
(1.10.3) Ry) = Oxy) — | “yro(@) ae 
The most familiar formulae for R(y) are Poisson's, viz. 
{11,10.4) Rly) =2 3 ["v9e@) cos 2nrada, 
1Jo 
aud Plana’s, viz. j 
(1.10.8) Ry) = if” yo a =) ao, 
i = 


Ramanujan’s is apparently new. 


G(x) = O(—x)—$4(0), in the notation of §11.4. 
13-2 
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Itis convenient to begin by proving a special case of the formula. Suppose 


that 1 

0) = Tey 
and Take i " F°G(2) de = | Sees 
Then [re es Fy \dy = ( F rol, evyr dy 


ifs>1. It follows from the inversion formula of Laplace that 
1 ctio ys 
Jy) =s5 ——— ds, 
(y) 27% ie slogs i 
where c>1. If we deform the contour into a lacet round the negative real 
axis, and allow for the residue at the pole s = 1, we obtain 


Es} e-va 
tw =° ardor 


11.11. Ramanujan gives a very ingenious proof of this special formula, 
He proves, more generally, that 


(11.11.1) 
ee a —l pyr ras in ng de =. 
er aa +{° gle (cosmg log x s) ea (one ev 


if y20 and £20. Ef we denote the left-hand side of (11.11.1) by p(y, é), 
and differentiate with respect to €, we find that 


Op _ eves ES same roe 
3 ~TU-8 Sa [“abtereede 


rt 1 LE) sin7€) _ 
a (pglg TE mS 


Hence p(y, ) is a function P(y) of y only. But if we differentiate with respect 
to y, we obtain 


dP © Yd n7é dx 
=—= above = 
dy J ~.T(@) ae |" (co0ne—= me *); ? + (log x)? 


=|" ; roan -{° we cos nt—* TE og *) aictegah 
= ply, €+1) = Py). 

Hence P(y) = Ce’, and we can find C by supposing that y = 0 and € = 07 
* We can also prove (I1.11.1) by the “Laplace transform” method. 
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We may now replace ¢(u) by 
h(a) = 99) 
F(1+u) 


in the general formula, so that we may suppose ¢(0) = 0. We may also 
suppose that J(u) is real for real u. These reductions are not essential, but 


they simplify our forma! analysis.* 


11.12. We can deduce Ramanujan’s formula from Plana’s as follows. 
We suppose again that ¢(u) belongs to &(4, P,d). Then 


to 


an-¥g(— 


Ed 1 
(11.12.1) cei dinamo | 


1 fe) 
= ge : ‘ 
2a i Sin mw” Ae) ae 


we can now take c = 0 because ¢(0) = 0. Hence 
© @,(—y2) 1 f BS Te ee ee i © gate 
i a{n? + (log x) ieee 2n | _osinmiw” Perea o x{7? + (log 2) aoe 


% foe} gabe d o g-iwt a ig 
(a ee = —— di = e-7lv 
. ae I, x{7 + (log x)*} - ie m+ ee 


° @,(—y2) _ sf ee. , 
and so i an? + (ogay} so me tod( — iw) dw, 


which reduces to Plana’s integral (11.10.5) by trivial transformations. 
There is a full discussion of Plana’s formula in Lindeldf’s Calcul des 
résidus. Lindel6f proves (11.10.5) under the conditions that 


(i) e-rlw. PHU bly + iw) > 0 


when | w|-~>oo, uniformly in any finite strip ~dsvSV, and 
(i) [7 etry | (e+ i)|dtn->0 


when v—oo, It is plain that these conditions are satisfied if ¢(u) belongs 
to &(A, P, 9d), with A < 27, and 

O<y<eP; 
and in particular, for such y, by the ¢(w) considered here. Thus Ramanujan’s 
formula (C) holds, in the form (11.10.1), and for these y, for the same ¢(u) 
for which we proved (A). 


* We could begin by verifying the formula for any special ¢(u) for which 4(0)+0, 
but I know of none for which (C) becomes quite trivial. 
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11.13. Tadd a remark about Plana’s formula. It may be proved directly 
as a corollary of Cauchy’s theorem: but it is worth while noticing how it 
can be deduced from Poisson’s formula, which, being a ‘‘real variable” 
formula, is probably now more familiar. Let us write Poisson’s formula 


(11.10.4) as co 
Rly) = 2 x Ji3 


write e~#, where f> P, for y; and v for x. Then 


—_ a I “90 en(P-2nnbe av} a mii | ai) ev Piw—anmw ae}, 


again by Cauchy’s theorem; the argument here demands only that A < 27." 
When we sum, we obtain Plana’s formula. 


11.14, Suppose, for example, that 
_ +4) 
00) = Foy Fea’ 
where 7 >0. Then (uz) belongs to & for 0<d<r, any positive A, and any 
P; and (11.10.1) gives 


(r-+2) eo (l+yr)* 
Gey (aa r(1+2) rae = | + (log) “* 


The integral on the left is elementary when r is an integer, and then that 
on the right can be evaluated in terms of elementary functions. 


Fourier transformations 
11.15. The formulae (D) embody a heuristic theory of Fourier transforms, 
a theory which is naturally valid only under very restrictive conditions. 
Suppose, for example, that A(u) is an integral function, and that 


(11.15.1) A(1) = A(3) = A(B) =... = 0, 
so that 
S(-DPA(r) SA) on 
(11.15.2) A(z) z= a x x al xo 


is even; and let 
(11.15.38) Ay(u) = | (2) 70+ 008 jun x —u— 1) 


Then A,(z) is also integral, since the poles of [(1+4) at —1, —3, ... are 
cancelled by zeros of cos fu, and those at —2, ~4,... by zeros of A(—u—1); 
and 


(11.15.4) A,(1) = Az(3) = AY(5) =... = 0. 
* Instead of 4 <7. 
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Also A,(2m) = (uy /() 2m!A(—2m—1) 


d 
an A= 2m=1) = [{=) a(em) tim {71 —2M +e) 005 HOM + 1—e)n) 


“on yee 


Hence, if we write 
ao 


M(x) = /(Z) 3 (— ye A(—2m— 1) 2% 


0 
and denote by A,, M, the functions formed from A, as A, M are formed 
a A,(a) = M(2), Myla) = A(a); 
and (D1) and the corresponding formula with A, become 


J@{re cos yxdz = M(y), {GQ [ine cos yedx = Aly). 


These are the ordinary formulae of the Fourier cosine transformation. 
If we suppose, instead of (11.15.1), that 


A(0) = A(2) = A(4) =... = 0, 
and define A,(w) by 


= J) P+) sin oor —w— 1), 


we arc led similarly to the formulae of the sine transformation, 


= 2B al = 2-7 at (-44) 
ay Me) = Fay Oy 
satisfies (11.15.1). In this case 
5 9 2-H} + 4u) Oru aft 
Ayu) = JG) I(1+ 4) cos hus. TT a) = Tae = 


” 


A(u) 


and A(ee) = M(w) = 3S (gazym = ea 
0 


The formulae express the ‘ eee property of e—*. 

On the other hand there are many familiar reciprocities which cannot be 
expressed in this way. Thus 

1 © COS Y% 
l+y? Jo 14a? 
‘The first of these is (when | y|<1) a case of (D1), with A(u) =1; but then 


Ay(u) = J) P(1+4) cos dur 


a 
(11.15.5) [er cosyerds = dx = tne-", 
0 


SET L122 
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is not an integral function, so that we cannot account for the second formula 
in this way (as is obvious because e~!# is not expansible as a power series in y). 


11.16. I worked out a theory of the formulae (D) in my paper 11. I shall 
state it here in a rather more general form due to Mr F. M. Goodspeed; 
his theory accounts for the formulae (11.15.5). 

Suppose that y(u) is an integral function and that 


x(%) 
(12.16.1) BTA o + Ho) 


where A <7, for all u. Then it follows from Cauchy’s theorem that 


= «o (—1)" x(n) oie 1 ie T xy(u) 
ee ee Userdn+h)* 27rt J oto SIN 7% Put ye" 


< CePiitalwl, 


when ~l<c<0, 0<a<e-?, 


the series being then convergent. The integral is uniformly convergent in 
any interval 0<d<#<A<oo, so that A(z) is regular for all positive x. 
Tf now y> 0 then 


(11.16.3) JA A(x) cos yada 
WS 0 
a 2 oo) 1 etic a aty(u) 
= JG i epsynde e-te Sinmu 2I(tu+4) ay 


2\ 1 petio a Cu) | rae 
~~ flan Joon aa AH |, oes 
The justification of the inversion is not quite trivial but proceeds on the 
same lines as the discussion of the same point in 11. 


Now { x cos yada = I(u+1) cos hut lyry*. 
0 


If we substitute this value in (11.16.3), and simplify by means of the 
duplication formula for the Gamma-function, we obtain 


2 oO = 1 e+to T pune ORY) 
MO ete Gone a= mle sinmi P= du) a 


__ 1) pee _ a y'x(-u-1) 
Ott) ote Sin Te T(E + du) 


(with a different c, but still with —1<c<0). This integral may again be 
calculated by Cauchy’s theorem, with the result 


(11.16.4) Jee cosyxdx = M(y), 
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where 
(= 1)"%x(-2-1)_, 


(11.16.5) M(z) = DoT (ingd 


This series also is convergent for 0<a<e-”, and M(x) is regular for all 
positive x. And it is plain from the symmetry of the argument that 


(11.16.6) 2 (=) { Cc) coayade = Aly). 


11.17. Ramanujan’s own formulae for cosine transforms are slightly less 


general. If we write 
xu) A) 
Turd) Tu+i) 


2-1 In 


or xu) = Tqu+1)™ 


and suppose that A(w) satisfies the conditions (11.15.1), then it will be 
found that the formulae of § 11.16 reduce to those of §11.15. The “order” 
conditions are equivalent. The analysis of §11.16 has the advantage of 
symmetry, and is more general in two respects. In the first place, the 
assumption that y(w) is integral is less exacting than the assumption that 
A(u) is integral, since it allows poles of A(u) for u = ~—2, —4,..., A more 
important point is that, in §11.15, both A(z) and M(x) are even analytic 
functions regular at the origin, while in $11.16 they are defined by the 
power series for positive x only, and then for negative « by evenness.” 


i _ 2eT(qu+3) 
Thus, if we take y(u) = Tut)’ 
then A(z) =e, M(x) = A Fee 


for x >0. The conditions of § 11.16 are satisfied, and, if we define A(x) and 
M(x) by evenness for <0, we obtain the formulae (11.15.5). 


Hf we take xu) =1 


we find that A(z) = M(z) = Samra” 


t In (D1) we are in an intermediate position. If we write the formula as 


Jira cosyxdz = M(y), 
Tt} 0 


then A(z) is defined like the A(x) of $11.16, but M(y) is regular at the origin. In 
§11.15 we make the relation symmetrical by specialisation. 
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for #>0, and the same remarks apply. This A(x), which may also be 
expressed by = es 
_ ne [ eH dt -{ eh —2v dw, 
x 0 
is one of the examples of “‘self-reciprocal” functions given by Hardy and 
Titchmarsh. 
Finally, the choice 


4 m2 
xe) = FE) PO) 
leads to A(z) = M(x) = e-#*, 


This case is covered by §11.15. 


11.18. I said in my introductory lecture that Ramanujan was familiar 
with most of the formal ideas which underlie the recent work of Watson, 
and of Titchmarsh and myself, on “‘Fouricr kernels” and “‘self-reciprocal 
functions’’. Here we have a case in point. 

It is plain that a necessary and sufficient condition for the A(a) of § 11.16 
to be self-reciprocal is that 


(11.18.1) x(u) = x(-u=1). 
If we write 

(1.18.2) Plu) = LG + uw) 0 — du) x(—), 
then (11.18.1) becomes 

(11.18.38) wu) = w(l—u). 


The integral expression for A(z) is 


Aa 


ri | pio Sin ru 2 F(4u + 3) 


If we change u into —u, substitute for x(—w) from (11.18.2), and perform 
some simple reductions, we obtain 


c+ ico 
(1.18.4) Ale) = = { ©" aT (bu) oyu) du 


which is (apart from a numerical factor) the formula for self-reciprocal 
functions given by Titchmarsh and myself. 

All this has its analogue in the theory of “general transforms”’, but 
I postpone this to the end of the lecture. 


Formula (E) 
11.19. We may write (E) as 


(1.19.1) [Pac Meyae - B (= 1)8A(—n= 1) 4), 
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A(a) and M(z) being the two infinite series which occur in the integrand 
of (E). We may as well consider a more general formula, viz. 


P(s+n) 
ni 


(11.19.2) [ae Meyertae x lyn ee ics 


= M%e){A—8) (0) FA —8 1) (1) +E a( 9-2) (2) — 


We can deduce (11.19.2) formally from (B) by writing 
[4@ M(x) ada = BF any ["Atey gnts—l dy 
0 0 wm! 0 


= 5 ¢— 1222) (sn) ala), 
0 n! 

and there are some few cases in which this procedure can be justified. They 

are however extremely special; and Ramanujan seems to have had no 

other argument, even when M(x) is cosyx, when proof on these lines is 

quite impossible. In this case 


(2m) = (—1yey”,  w(2m+1) = 0, 
and (11.19.1) reduces to (D1). 


The formula becomes simpler when expressed in the notation of § 11.4." 


Then 
_ Aw) _ mu) 
4) =Tasay YO“ Pasa 


(x) is defined as in § 11.4 and V(x) pee and (11.19.2) takes the form 


E H(—n—s) Hn). 


(11.19.3) [ “O(e) Maye de = ad 


This formula is simpler, but rather misleading, and (11.19.2) is really the 
better standard form. We can see this by considering the most obvious 
special cases. 

If we take Xu) =a, plu) = BY, 
where 0<#<za, in (11.19.2), we obtain 

P(s) 

-2,92 = 

BES eo it 
which is true for all positive s. But if we take ¢(u) = a”, (uw) = BY in 
(11.19.3), we obtain 


* xs1dz 7 ais 
= — —s-1 —s—2 ey _ —~_ 
J (1+ax) (1+ px) a? ce a de dhs sin sm a— 


which is always false. 


ae" 1) ao 


* co 
| etePeas1de — I(s)\a as =F ap 4 
0 
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The correct formula is 


ae as-ldx T ais — pis 
f (l+ax)(1+fx) sinst a—f 


which is true for 0<s <2, We can write the right-hand side as 


ai a (a8 + asR a oe 8262 AES alfi-s a apis Bites ) 


{6(-s) ¥(0) + 4(—s—1)¥(1) +... -d(-1) W(l—-s) 
—$(—2) ¥(2—s8)—...}. 


Here there are two series instead of one, and in fact, as we shall see in a 


moment, the “right” formula for the general integral is not (11.19.3) but 
(11.19.4) 


= sin 87 


[Poe veya tae = {3 6(—n-2) Hm) -E 6(—n— n+ 1-9]. 


sin s7 
This reduces to (11.19.3) when ¢(u) vanishes for « = —1, —2,..., as when 
au 
p(w) = Td +4) 


It is therefore better to work with (11.19.2). I shall not do more than 
indicate the connection of this formula with the standard formulae of the 
theory of Mellin transforms. There is a rigorous investigation in an 
unpublished paper of Goodspeed. 


11.20. We may regard (11.19.2) as a form of the “‘Parseval” theorem 
for Mellin transforms. This theorem asserts that, if 


(11.20.1) f(z) =| F(z) ede, 9(s) = | eteyarrae, 
0 0 
then, under certain conditions, 


co) k+to 
(11.20.2) ie F(x) Gteyerde = | f(e)g(s—%) du. 


Now in certain circumstances, which we considered in detail in § 11.4, 
we have 


(1.20.8) 
I * Ate) td {Tae i r M(x)aelde = I's) u(—8), 
(11.20.4) 


Agee! i ’TM)A(—u)zdu, M(x) = i Pa) p( uate 
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In fact the formulae (11.20.3) are cases of (B), and if we write 
_ Bm 


_ __Atu) 
d{u) = TU +uy yu) a Fil+u)’ 
then the formulae (11.20.4) become cases of (11.4.4). 
Taking F(x) = A(x), G(x) = M(a) in (11.20.1), (11.20.2) gives 
} k+io 


[Fe G(x)asdze = = Tu) P(s—u) A(— 4) w(u—s) du. 
0 


27% k-ico 
If this is true for a & between 0 and g, then the last integral, when calculated 
by residues on the right, is 
io) ( aa 1)" 
DISET s +n) Al—s—n) nln), 
0 nN! 
and we obtain (11.19.2). The whole argument, of course, requires careful 


consideration. 
If we argue with ¢ and y instead of with A and yu, we have 


. 


i "O(a)etde = 5 $(-8), i 4 Pu) a Ade = —"_ y(—), 
Pad _ 1 K+io a T 
I, Reha) e ee Qt ieee sin ur sin(s— wu) 7 Po cue ine ld, 


and when we calculate this integral by residues we are led to (11.19.4). It 
is only when ¢(u) vanishes for negative integral values of u that our earlier 
formula (11.19.3) is correct. 


11.21. Formula (11.19.2) is a very powerful tool for the evaluation of 
definite integrals. If for example 
1 I 
See Pe (Ty att ee ey V0 
Fae ceay ? ? 03) L(b+1+u) (38) ? 
then Ala) = 24a-9q-4a J, (a. s/c), M(x) = 226-%a- PU, (8 fee). 
The formula reduces, after some trivial transformations," to 


: im ee I(s-+4a+ 3) 
i; Laake 2a rents 4) 


Au) = 


2 
<2 B( e+ 4a+-46, 48 fo+ 40,5 + i: ‘). 


which is true whenever 0< #<« and the integral is convergent.” 


* 9 for x, }(st+a+b) for s. 
1 Le if -a—b<s<2. 
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Formulae (F) 


11.22. The last group of formulae contain a further justification of the 
assertion of my own which 1 quoted at the beginning of §11.18. I shall 
regard them, as did Ramanujan, simply as formulae, and say nothing at 
all here about conditions for their validity. Suppose that, in our previous 


as Alu) = atp(us), le) = Bg), 
where 0<#<«a and cee 
Then A(x) = F(ax), M(x) = G(fx), 
where Fie) = 3 s(n) z, Gla) = 3I— anyar, 
Oo wm: Ms 
and (11.19.2) gives 
fr 1 


(11.22.1) [or Flos ) O(pe)de = 3-1)" Sa =o 


For example, a solution of this integral equation is 
( poe 1 y en Z 
nt cosh {eJ(n+1)}° 


F(x) = scr cos(c nx", G(x) = 2 
Now 


| “ada i ‘ Plax) O(Bx) de = i E G(r) de * FR aunyelde | 
me if. e*G(x) da = ff (8) 9(1—8), 


where f(s) and g(s) are defined by (F 2). Since the first integral here is 
ie of Bo, 


0 eG ad ~ sinan 


we obtain (F3). 
I now recall some fundamental formulae in the theory of “Fourier 
kernels”. If K(x) is a Fourier kernel, that is to say if 


(1.22.2) | Ate) Ray) de = By) 
implies 
(11.22.3) [ B(x) K(xy) da = A(y) 


for “arbitrary” A(a), and 
= [ Keye-tde, 
then : 
(11.22.4) k(s)k(1—s) =1. 
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More generally, if (11.22.2) implies 


(1.22.5) i B(x) A(xy) da = Aly), 


and h(s) is defined like k(s), then 
(11.22.6) k(s)R(1—s) = 1, 


Ramanujan never, I think, writes down quite these equations, but (F4} 
and (F6) are formally nruch the same. For if 


Rae Tea (Fai) + P(-2i)}, Ha) = oa {Q(vi) + G( —xi)}, 
then k(s) = zeal}, Ff (at) a®-tda + i F(~ai) 23-1 aa| 
= 765 {t-§ + (—4)-3 f(s) = /@) cos $s7 f(s); 
ee 


so that (11.22.6) becomes (F'3). 

All this was naturally simple formalism in Ramanujan’s bands. Its 
translation into substantial analysis demands idcas which were quite beyond 
his ken, Plancherel’s theory and Watson’s generalisation. But a formal 
basis is essential as a foundation for all such theories, and that Ramanujan 
possessed. 


11.23. I conclude with a few remarks about the topic which I postponed 
in § 11.18, the extension to “general transforms” of formulae (D). 
Suppose that 
k(s) 


(1.23.1) ks) = aps 


so that (11.22.4) is satisfied identically. Then 
= 2m e—to 


K(x) : [Or etay udu 


is a Fourier kernel whose Mellin transform is k(s). 
Tf we start from the formula 


=¥ nm), pete r(—4) __, 
Rie) = Z(-1) K(n+ 1)” ~sil sinum K(1—w)~ aes 
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and argue on the formal lines of § 11.16, we obtain 


ctia 
[x R(w) K(ay) dx =a ulema wsdu] K( K (ay) ad 


270 J ein SiN UT Rea) 


aes am #r— ok aah u)y"tdu 


Oni cto Sin um K(1—4u) 
1 petite on (4) 

- al. sinum K(u) es 

es aoe) a. 

Fa eyo? 

so that 
(1.28.2) 
Beefy Red cae =e (— 1 e* 2D an 
Re) = D(- ee Se) = E(-1P 


are a pair of ‘‘K-transforms”’ satisfying 
(11.23.3) 


[PR Keanade = Se), | 8(@) Keep)de = RY) 
The condition for a “‘self-reciprocal” R(x) is that 
r(u) = 7(-u—1). 
Thus if K(e) = fz J,(2), 


= 9e-4 
then k(s) 8 Taq a= aa! 
and we may take k(s) = 24P(4s +4404 4). 
Th Riz) = &(-1y 4 ot 
“ OO TapeD 
d Ss _< —1)\r *(—n—-1) —inpn 
= @=2(-I rape 
are a pair of Hankel transforms. 
2 1 
os ey 71a? 
then k(s) = cot4sz, 
and we may take k(s) = cosec hz. 
In this case 
R(x) = X(- 1)*r(2n)2", Sa) =F (-1)?r(—2n— 1) a; 
0 
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the integrals (11.23.3) are principal values. If r(u) = 1, then 


1 
Re) = Sia) = Tae’ 
and other simple rational functions self-reciprocal for K(x) may be obtained 
by making #(w) an appropriate polynomial. Here also I must restrict 
myself to 2 mere sketch of the formal lines of the theory, adding only that 
all the analysis of §§ 11.22-23 has also been made exact by Goodspeed. 


NOTES ON LECTURE X31 


$11.2. The quotation from Littlewood oecurs mn his review of the Papers. 

811.3. Carlson’s theorem is proved in Titchmarsh, Theory of functions, 185-186. 

The theorem has been generalised in various ways. In the first place, we ean weaken 
the order conditions on 4(u); and secondly wo may replace the conditions 


$0) = PL) =... = 0 
by more general conditions of the type #(A,) = 0 (n=0,1,2,...). The most general 


theorem of this kind which I have scen was communicated to me by Mr N. Levinson, 
and runs: if (i) 6(2) is regular and O(e4!!), for some A, in the right-hand half-plane; 


(it) i os loge loty bien =a ie a eee 
(iti) PAn) = 9, 


where A,, is an increasing sequence of positive numbers such that 


1 
A, <Bn, YX —>logR-C, 


for some B and C; then d(u) = 0. 
§11.4. For Mellin’s formula see Titchmarsh, Fourier integrals, 7-9 and 46-48. 
The proof of (A) is that given by Hardy, Acta Math. 42 (1920), 327-339. The con- 
ditions on ¢(u) may be relaxed here also, for example to 
Plu) = O(e4!) (some A), P(iw) = O(| w|-Fe™), 
Goodspecd, in an unpublished manuscript, has proved a ‘A,,’ generalisation of (A). 


EEE, 0<Ay<Anuy O<ANSA,S Bn, 
iad cee 
u)= ~aa}, 
a= fi.-¥) 
and f (u) belongs to §(C, D, 6), where C<7/B and 6>0, for some D, then the series 
2 FA) 
nmi 9 (An) 


converges, when the terms are grouped appropriately, for sufficiently small x, and repre- 
senis an analytic function H(x) regular for all positive x; and 
‘oO — 
f eH (2) de = f(=8) 
Q gs) 
for 0<8<0. If also Agi ~AgZA>O, then the series converges in the ordinary sense. 


HR T4 
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§11.6. The connection of (B) with Newton’s interpolation formula was pointed 
out by Narayana Aiyar (2). 

For the theory of Newton's formula see Nérlund, Vorlesungen wiber Differenzen- 
rechnung, 222 et seg. There is much useful information, but no function-theory, in 
Whittaker and Robinson’s Calculus of observations. 

For the justification of the term-by-term integration sce Hardy, Trans. Camb. 
Phil. Soc. 21 (1912), 1-48 (5-6), and Messenger of Math. 39 (1910), 136-139. There 
is a similar theorem relevant to the reduction of (A) at the end of the section. 


§11.7. Ramanujan’s analysis may be extended, for example, to cases in which 
xz) = Zc,2?r, 


where p, > 00, the p, need not be integers, and a finite number of them may be negative. 
This extension covers the case A(u) = e*“, for any real «. A case actually covered by 
his analysis as it stands is 


§11.8. There is a direct proof of formula (ii) by Cauchy’s theorem in Hardy (5). 
For formula (iv) see M. Riesz, Acta Math. 40 (1916), 185-190, and Hardy and Littlo- 
wood, ibid. 41 (1917), 119-196 (156-162). 


§11.9. For Lagrange’s and Burmann’s series see, for example, Whittaker and 
Watson, Modern analysis, ed. 4 (1927), 128-133. 

The most complete results concerning the expansion of roots of trinomial equations 
are those of Birkeland, Math. Zeitschrift, 26 (1927), 566-578. Birkeland gives full 
references to earlier work of Mellin and other writers. 


§§11.10-13. The substance of the proof is taken from Hardy (3), but Ramanujan’s 
argument in §11.11 has not been printed before. 

Laplace’s inversion formula, used in §11.10, is another formal variant of Fourier’s. 
See Titchmarsh, Fourier integrals, 6-7 and 48-49. 

For a direct proof of Plana’s formula by mcans of Cauchy’s theorem see Lindeléf, 
Le calcul des résidus, 55-62, 


§§ 11.15-18. See Hardy (11) and Goodspeed (1). The penultimate oxample of § 11.17 
occurs in Hardy and Titchmarsh, Quarterly Journal of Math. (Oxford), 1 (1930), 
196-231 (210); and (11.18.4) on p. 198 of the same paper. 


§§11.19-20. It is curious that Ramanujan seems never to have written down 
(11.19.2), or even (11.19.1) in its genoral form. In the notebooks and reports one of 
the functions is always even. 

For the Parseval formula for Mellin transforms see Titchmarsh, Fourier integrals, 
94-95, There s = o+it, 0<k<o, and a*F and a2¢-'G aro L%(0,00). The integrals 
(11.20.1) are ‘mean square’ integrals, and (11.20.2) is true, in the ordinary sense, for 
all s whose real part is o. 

Goodspeed, using a method more like that of §11.4, has proved that (11.19.4) zs 
true whenever 


(1) Blu) ts an integral function, and belongs to R(A, B, 1), where A<m, for some B 
and any positive 7; 


(2) yr(u) belongs to &(C,D, 8), where C <7, for some D and a & not less than 1; 
(3) O<s<2; 
(4) b(—u—s) p(w) = ofe#luly, 


where H< 27, uniformly in w, when v > 00. 
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It should be observed that ¢(u) occurs in the theerem with arguments n, —n—s, 
and —n—land ¢(u) with arguments n and n+ 1—s. The cenditiens that $(u) sheuld 
be integral, and y(w) regular for o> —1, are therefere natural. 

811.21. See the remarks at the end of §11.5. In this case each ef A(u) and p(x) 
behaves reughly like e!7!, and each of ¢(u) and ¥(u) roughly like e7!”, in the direction 
of the imaginary axis, and the final integral is net necessarily abselutely convergent. 
Tt is net difficult te justify the transfermatiens in this or any other particular case. 

The integral was first evaluated by Weber and Schafheitli: see Watson, Bessel 
functions, 398-410. 

§11.22. Fer the theery ef ‘Fourier kernels’ and ‘general transforms’ see Hardy 
and Titchmarsh, Proc. London Math. Soc. (2), 35 (1933), 116-155; Watsen, ¢bid. 
156-199; er Titchmarsh, Fourier integrals, ch. vir. 

The subject has attracted a geed deal of attentien since the appearance of Watsen’s 
paper. Full references, up te 1937, will be feund in Titchmarsh’s beok. 


$11.23. Geodspeed has preved that if x(u) ts real on the real axis, and is the reciprocal 
of a function regular for v= — 0d, where 0 ts positive; r(u) 1s an integral function; and 
r(u) r( —l-4) 
K+u)’ «(1tu) 
belong to the classes R(A, B, 3) and R(C, D, 0), with A<m and C <n, then k(s) defines a 
Watson transformation in which Rix) and S(x) are reciprocal. 
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XI 
ELLIPTIC AND MODULAR FUNCTIONS 


12.1. I must end by saying something about Ramanujan’s work on 
elliptic and modular functions, and it is this which I find my most difficult 
task. It is here that both the profundity and the limitations of Ramanujan’s 
lmowledge stand out most sharply, and that it is least possible to decide 
how much he may have learnt from others. Besides, I do not know the 
subject very well. 

Ramanujan never professed to have made any major advance ‘1 the 
general theory of elliptic functions, and it seems that he must have learnt 
the fundamentals of the theory, so far as he was interested in them, from 
books. There is a sharp contrast between his attitude here and his attitude 
about the theory of primes, where he certainly regarded all his results as 
his own, He never writes as if he had invented theta-functions or modular 
equations, though he sets out a whole theory of them in a language . 
of his own. Cayley’s and Greenhill’s books were in the Madras University 
Library, and he could have found a good deal about them there. Indeed 
Littlewood says that “Ramanujan somehow acquired an effectively 
complete knowledge of the formal side of the theory of elliptic functions”, 
and that his ignorance of complex function theory aud of Cauchy's theorem 
may seem difficult to reconcile with this; and adds that “a sufficient, and 
I think necessary, explanation would be that Greenhill’s very odd and 
individual Eiliptic functions was his text-book”. In Greenhill’s book the 
complex variable and double periodicity are not mentioned until p. 254, 
and the double periodicity is deduced somehow from properties of Cartesian. 
ovals. In fact Greenhill knew very little more ‘function theory” than 
Ramanujan. 

I said that I did not know the subject well, and I shall rely very largely, 
in all that follows, upon Professor Watson. In the first place, his lecture 
to the London Mathematical Society contains a short account of the 
relevant chapters of the notebooks. Secondly, he has very kindly lent me 
a manuscript containing proofs of nearly all Ramanujan’s formulae, 
without which I should have been very much lost. 


12.2, The relevant chapters are Chs. xvI-xx1; these, in Watson’s opinion, 
“show Ramanujan at his best”’. In Ch. xvz he starts with the function 


H(a,b) = (1+a) (1+ab) (1+ab2)..., 
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and all his later work is based on it. He develops a whole series of theorems, 
mostly to be found in Euler, Gauss, Jacobi, Heine, or other writers, though 
there are some which seem to be new.’ I shall not refer to these formulae 
except when it is necessary for my immediate purpose, which is to track 
down in the notebooks the proof of the modular equation of degree 3. 
Ramanujan also writes 
f{(a,6) = 14 (a@+6)+ab(a?+ 5?) + a269(a? + 59) + 


(the indices of powers of ab being the triangular numbers). Thus, in the 


orthodox notation? a5 b) = 9,(2, ze 
uae -4 logs T= sy log ab; 
or again i (a,b) = pl2,9); 


cy + 
where p(2,g) = ghz", z= etm = (; \\) gq = ert = (aby. 
Ramanujan gives Jacobi’s fundamental factorisation formula in the form 
f(a, 6) = (a, ab) 11(b, ab) T(— ab, ab). 
Finally, he writes3 


09) =f(GQ = 1+2¢+2¢'+ 

HQ) =fGP) =lt+qt Prt... 
f-M=f(-@ -@) =1-¢-P ++ 

MQ) = 71¢.¢) = 1+) +9%) (149°)... 


Thus A(q) = (0,7). 
Further, if, with Tannery and Molk, we define gp, q1,¢2 and 93 by 


% = lived) hh = I(.+9"), Ge = H+"), Ws = Tieg): 


then X(2) = Qs 
f(-9) = (1-q) (1-9?) (1-9). = GG 
(1g?) (1-94) (1-9). _ Yo 
WO) = Tg =a (=a) 
The last pair of formulae embody famous identities of Euler and Gauss. 
We have also v(@?) = 49*8,(0, 7) 


(a formula which we shall want later). 


* Especially 16.17 (in the numeration of the notebooks), which I shall refer to again 
in § 12.12, 
* That of Tannery and Molk. 3 I replace his « by the customary q. 
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12.3. In this notation many fundamental formulae appear in odd 
disguises. Thus the “inversion theorem” (there is a 7 corresponding to a 
given k?) appears as follows. Writing c for i?, defining K and K’ as definite 
integrals or hypergeometric series, and putting 


e~TK'E = H(c), 


then the equation q= A(c) 
4(— 
has the solution c=l oo 
Again the theorem snbu+enu = 1 


appears in the form: if ¢ = e~¥ and 


sin}d —-sin30 _ 00s$6 | cos 3 
Swe. sinh dy? sinh by OT) = cosh dy ° cosh $y ° 
then S®4 C0? = hPpt(y). 
kK Ké kK KO 
In fact S(0,y) = 7 ea C(é,y) = = a 
and K = 49 93(0,7) = b*(y). 


There are proofs of these formulae, by direct squaring of series, in Jacobi’s 
Fundamenta nova and Enneper’s Hiliptische Funktionen. It is very unlikely 
that Ramanujan had seen either book, but this type of argument was 
familiar to him. 


The modular equation of degree 3 


12.4. The modular equation of degree » is the relation between the 
moduli & and / which corresponds to the change of ¢ into g"”,* Le. to 


gees 
LK? 

where K, K', L, L’ are the complete elliptic integrals with moduli & and I. 
Ramanujan deduced his modular equations from relations between his 


functions 7, ¢, y, that is to say from relations between }-functions. Thus 
his forms of the equation of degree 5 are derived from the identities 


PO-SP) = fh aPIG,¢)? 
and VO-W@) =fGNIGP)3 
The first of these, in the orthodox notation, is 
O20, 7) -03(0, 57) = 4e7*rO,( — 27, 57)95(—7, 57). 


* Or into g", when L'/L =nK'/K. See § 12.7. 
+ 19.10 (iv). 3 19.10(v). 


(12.4.1) 
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The identities are proved by “elementary” arguments, the whole process 
being “algebraical”’. 

I shall confine myself to the equation of order 3, and I shall begin by 
explaining two of the standard methods of proof. The first rests on direet 
transformations of integrals, and is Jacobi’s version of Legendre’s original 
proof. The second, found mueh later by Schréter, is much more like 
Ramanujan’s. 

12.5. Jacobi’s argument is oxpounded in many text-books. We try to 
satisfy the equation 


mdy — dx 
(12.5.1) TW > Je" 
where X = (1~ 2) (1~ kx’), Y = (1—y*) (1-/y’), 
O0<k<1, O0<i<1, 
by taking 
12.5.2) a 
(12.5, y= p> 


where U and FV are coprime polynomials in a of degree not exceeding 3, 
and y vanishes with x and is positive for positive x. It is plain (sinee y is 
an odd function of x) that 

U =a(at+ £2), V=y+de?, 
where a, 8, y, 6 are eonstants. 


dy _ JSS 


‘i aie 
Neat Te (1 — 2) (1 — ba?) =o 


for small x, so that m is positive; and y = 1 when x = 1, since y would 
otherwise have a braneh point for 2 = 1. Further (12.5.1), and so (12.5.2), 
is unaltered when we substitute 1 /ka and 1/ly for x and y, so that the values 


di? 31 
y¥= L-Lpr-q 


correspond to g=1,-1, a -; 
: dy _ uy-UuVv’ 
Since Jy = [Py (ri Boy de, 


wemusthave (V?—U2)(V2—PU?) = T%(1—2%) (1-2), 
where T is a quartic; so that 
dy _ U'V-UV’ 
VY PAX 


UV-UV' = a 
m 


dz, 
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Next, since no two of ViU, V+IU can have a common factor, we 
must have VtU =(1+a)A%, V+lU = (1+ ke) 0%, 
V—-Us=(i-2)B’, V-—IlU = (1-ka) D, 
where A, B, CO, D are linear. It follows (since y is odd) that 
l-y (Ge) 


Ity 1l+a\l+ca}’ 
_ 2(2¢+1+ cx?) 
of. y= 1+c(c+ 2) 22 ’ 


for some ¢. This equation must be unchanged if we substitute 1/kx:, 1/ly 


for x, y, when it becomes 
_ ka c(e+2)+ hea? 
Y= T+ (Qct1) Px 


e+ 2) ay (ga) 


and therefore 


(12.5.3) B= 


2e+1 2c+1 
We obtain a relation between & and J by eliminating c. In fact 


. . (l+¢e8(l—c) io yg l-c\§ 
(12.5.4) W¥=1-e= "pe, Mell =a+a(s—sy, 
and so 
(12.5.5) fel) tafe) ee i 


~ 241 2641 
when appropriate values of the radicals are taken. This is Legendre’s form 
of the relation. It is easy to deduce Jacobi’s form 

(12.5.6) ut —vt+ Quv(1—u?v?) = 0, 
where k = uf and] = v4, 

Comparing the values of x and y for small ~, we see that 
“eee 

~ 2641" 

12.6. The equations (12.5.3), (12.5.5), and (12.5.6) are in fact different 
forms of the modular equation of degree 3; but in order to see this, we 
must show that they correspond to (12.4.1), with 2 = 3, as well as to 
(12.8.1). 

It is easy to see’ that, if 0</?<1, then the equation 
Da gae2 

2e+1 


(12.5.7) 


(12.6.1) Be 


* By means of ‘graphical’ considerations and the equation 


wef 2041\2 
an arr me 
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in c has just two real roots, one between 0 and I and one between —2 
and —1, and that these roots make l?>k® and (?<k? respectively. If, in 


the equations (12.5.4), we put 
l-c _ l-y 


(12.6.2) rr Pope 
then 

22 y+2 . 12 3 Yt2 

(12.6.3) I a Oa , 2b y ae 


Tt now follows from the analysis of § 12.5 that 


1 da aA dy 
m’ (1-2) (1-1?2%)} ~ (9?) (1-9) 
1 


’ 


with m 


ape t 
Hence, integrating between 0 and 1, 
ae 
m 
while (12.5.1) gives mL= K, 
L x 
Thus Te =mm ia 
and it is only necessary to show that 
mm’ = 4, 
But mm = on 
~ (26+1) (2741) 
3 pF 
and by + l= ga mm' = %, 
by (12.6.2). 
12.7. The equation (12.5.5) is symmetrical in & and J, and corresponds to 
lees 
LK 


(ie. to the change of g into g") as well as to (12.4.1). We can see this more 


directly. If we put 
__e+2 e+2 


=~oeey © =~Berl 


in (12.5.3), we obtain ae 
wae(s 2) p aoe +2) 
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ie. (12.5.3) with c’ for c and & and l interchanged. Also c’ lies between —2 
and — 1 whenc lies between 0 and 1, and conversely, so that c’ is the second 
root of (12.6.1). If y’ ig associated with c’ as y is with c, then 


(1420) (142c')=—38, (14+ 2y)(142y") = -3, 


and so (14+ 2c’) (1+ 2y’) = 3 
LE! &' 
and LE = 35 . 


12.8. The second method of proof depends on direct manipulation of 
theta-series. The equation (12.5.5) is equivalent to 


(12.8.1) (0,7) ,(0, 37) — 9,(0, 7) 8,(0, 37) = 9,(0, 7) 9,(0, 37) 
or to 
(12.8.2) QE gieBn? Sigler WP Be+1? 


where S indicates summation over all integers and 2” summation over all 
integers of opposite parity. If now we put 


M+N=U%, M—-N=>2, 
so that 


2 —y\2 
m*+ 3n? = () +3(" *) =w—uvt+e? = (u—dv)?+ 3(40)*, 


2 2 
then the left-hand side of (12.8.2) becomes 


Des gi P4300)? 
u,vodd 


Next, if we put u—jv=p+h, v=vt+sh, 
so that Us ftv+dl, v= 2v+), 
and let » and v run through all values of opposite parity, then % and v run 
through all odd values, and so 


ha)? 
2 o do)t43(}0)? — Dy! Qe tPBeTy? 
U;VO 


where the dash has the same meaning as before. Finally, if 


t 


fo =p], Be Bs 
then (a! + 3) +300 +S)? = (w+ 4)? +3(v 4 5)%, 


and p’ and v' run through all values of the same parity. Hence 


2 
—})2. 4)2 32. 2 1h 
DY ger D807? = 2 pay gi?) +2044)? — 9 y gr ant 


—@ 


which is (12.8.2). 
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12.9. This proof is much like those given by Schroter of the equations 
for this and certain higher degrees. Schréter found the general identity 


(12.9.1) 8,(2, ar) d3(y, &r) 
atf-1 F 
= D> gPremastntytrar, (a+ B)7}O,{be—ay+rapr, aB(a+ B)r}, 
r=] 
where « and # are any positive integers. There is a fairly simple proof of 
this formula in Tannery and Molk, where it is applicd to the cases n = 3 
and n = 5. Schréter also worked out the cases 11, 23 and 31; thus his 
form of the equation of degree 23 is 
(kL)t+ (kV) + Qk) = 1, 
which also appears in the notebook.’ It would seem that Ramanujan must 
have known (12.9.1). He has a formula? for 
f(a, O)f(e, 4), 
ie. for 3,(v,7)9,(v’,7'), as the sum of an infinite series, which was also 
found by Schréter, and in certain cascs this series becomes finite. “These 
compact formulae’, says Watson, “were not given in the notebook, but 
in view of the numerous special cases which he does give, he must have 


been aware of their existence.” 
Ramanujan gave the known forms for 


3, 5, 7, 11, 23 
(with many variants in the simpler cases), and new forms for 
13, 17, 19, 31, 47, 71. 
Thus the equation of degree 13 appears as} 


Ly, ay er 

“ (;) H(p kk’ kk)? 

13 _ (ky (R\t_ (Ret aay 

(7) +(7) a aa 

and that of degree 17 as4 

We (yt (W\t of Wes, (Aye (UE 
m= (2) +z) +(e) -+(z) [+() +(e)} 
17 [k\t (k’\3 [hi\t  (kek'\* 4) (5) 
mo (a) +(e) Ge) Gr) (7) +e) 


Watson has now worked out proofs of all of these. He remarks that 


“although modular equations of degrees 13, 17 and 19 have been obtained 


* 90.15 (i). 2 16.36 (i), (ii). 
3 20.8 (iil), (iv). + 20.12 (iii), (iv). 
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by other authors, they have usually been given in more complicated forms, 
and in fact, when dealing with Ramanujan’s modular equations generally, 
it has always seemed to me that knowledge of other people’s work is a 
positive disadvantage in that it tends to put one off the shortest track”’, 
There are also many ‘‘mixed” modular equations or equations of com- 
posite degree. For example, if the moduli &, x, J, A correspond to gq, q°, 5 
and g!® respectively, then* 
(KAJ — (R’A'}E = (xl)? = (aT) 
and? 
(HA) {(1 +2)# (1+ A}t— (1 — &)t (1 - A) 
+H(R'A') ie, R(IL+A/-(1—# yt AY = 4/2. 
I shall say no more about results of this kind. I bate to disentangle 
from the notebook Ramanujan’s own proof of the equation of degree 3, 
which appears there as the climax of a very intricate chain of reasoning. 


It is of course quite likely that he possessed a shorter proof on the lines 
of $12.8, 


12.10. Ramanujan actually deduces (12.8.1) and (12.5.5) from two - 


identities} involving series of the “Lambert” type, viz 


7 A g rt: g ‘ q = git 
(12.101) g@)Wa") = 72a-ys aot pega 


(12.10.2) ota) da") = 142( 2-2 e +.) 
q 1+q 1t+q* 1-9 
the signs in each series recurring to modulus 6. From these it follows that 
(12.10.38) 4gyr(g?) ¥(@*) = b(g) 6(9°) — 6( — 9) #(-9°);# 
and, since 
P(g) = 95(0,7), $(—9) = 94(0,7), YG?) = $9-*9,(0,7), 
(12.10.3) is equivalent to (12.8.1). It remains to trace the genesis of (12.10.1) 
and (12.10.2): this I have only been able to do with the help of Watson’s 
solutions. 
L call a relation between f, ¢, yr “trivial” if it is a direct consequence of 
their expressions as products. Thus 


f(a-7) (9) 
12.10.4 — = 
vege?) A-ae) ~ 6°) 
is “trivial”. For if we write 
1—q* ee (n), l+q" = (m), 


t 20.11 (iri). ? 20.11 (vi). 3 19.3 (i), (i). 
4 This formula does not occur explicitly in the notebook. 


Elluptie and modular functions 221 
then it is 
(1) (4) (7) (10) (2) (5) (8) (11)... (2) (4) (6)... (2 B)P@)... 
(1) (4) (7) (10) ... (2) (5) (8)(11)... (8) (12) (18)... (3)2 (9)? (15)? * 


Such a formula may be verified by observing that 


— 


—, _ (2) 
ay 
and comparing factors. 
Similarly 
az10s) FEL gat am) = aia vee 
is “trivial’’. 


12.41. The proofs of (12.10.1) and (12.10.2) may be traced back to earlier 
formulae, viz. 


(12.11.1) ee”) 52-8) =) | 


a+b a+b? ) 
i= 


Tab I¢@pet 
and 


(1211.2) f%(a,b) fa, = 0) = dag (= ,a%b) yratbt)s 


Let us assume these formulae provisionally. The series in brackets in 
(12.11.1) is 
oO ar +hn 


ao wo 
— 1)” fqim+in gn _. gn (m+n 
nail + T+arb" = 2, Z A ) { } 


o qmtlgm ampmt 


Hence (12.11.1) is equivalent to 


(12.11.3) LOO oH ab)=1+2 = ( 1 ( 


qmtipm ampmtl 
l— gmt bm ]—gmpmtiy* 


If we put a = g, b = —q?, and use (12.10.4), then (12.11.38) gives (12.10.2). 
It also follows from (12.11.3) that 


{fe —b) is6 a, ka $(ab) = 4 > E ( qmtipm am pmsl 


I( —a, b) T( : T= “gim+2 52m 1 —q2mpante 
If we transform the left-hand side by (12.11.2) and the “trivial” identity 
f(a, —6) f(a, 6) = f(-@, —b) p(ab 


T 16.33 (corollary). 
3 16.30 (vi). 
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we obtain " 
aaf(—2, —a% 

f(a, —6)—f2(-4@, 6) = a 2ab age 
(Eu ecu 


saf( a —a'b) 
= F(-@, 8) dab) yr(ab?). 
Hence 
b 
af seeareeT Seas ab el m+1 pm qm pmti 
sleet ¢(ab) y(a°b?) = 3; tip = j= a) = 


Finally, if we replace a and 6 by g and g®, and use (12.10.5), we obtain 


/ 0 6m+1 Bm+5 
ay (9?) 49") = 3 (4 ae~ a) , 
which is (12.10.1). 
Thus everything is reduced to the proof of (12.11.1) and (12.11.2). 


12.12. The first of these formulae is equivalent to 


sd _ 1 940, 7)94(v+ 3,7) 
(12.12.1) ED py gen 008 2? = F507) 8500+ 4,7)" 


Tannery and Molk prove formulae of this type by means of Cauchy’s 
theorem. We can deduce (12.11.1) from (12.12.1) by putting 


3 
que =(ab)}, z= er = (5) : 


and then making a “‘trivial” transformation. Ramanujan’s proof is natur- 
ally very different. He deduces (12.12.1) from a remarkable formula* 
with many parameters, viz. 


(12.12.2) 
IT (xy, x*) nf{E,24) IT( —2, x?) IT( —afx?, x?) 


& 1+ (yay 4) 
T(axy, 2%) n{E* 2) II( —ax?, 22) I — Bx®, x2) 1—fx®  yl—an? 


slow Besos Conia 


(.— fa) T= frt) *\y) Cant) (1a) 
_ (=a) (ata) (za) 
+ (cou (— 2?) qa — fx) qa — px) 
160% 


tou) toe 
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This formula seems to be new. It is however deduced from one which is 
familiar and probably goes back to Euler, viz. 
IT(b,x) a+b | (a+6)(a+bx) , 


if(—a,a) I—x (1—x)(1—a8) 7" 
In the application of (12.12.2) to the proof of (12.11.1) 


x 
sy=a, -=b, a=f=-1, 
y 7 B 


12.13. Finally (12.11.2) is equivalent to 
S3(v, T) —O4(v, 7) = 29,(0, 27)9,( 20, 27), 
which is derived by Tannery and Molk from the theory of Landen’s quadratic 
transformation. Ramanujan deduces it from 


Fa, B)fle4)—f(—4, ~D)f( 6, ~€) = 2af (2,5 aboa) #(5,$ abcd), 


where ab = 
The direct proof of this formula is quite simple, if we write the left-hand 
side as 2 > qimm+l) pim(n—-} gin(n+l) gintn-2) 
: m+n even 
and put min=2M, m—-n=2N, 


when the summation is over all integral M,N. 


12.14. It is obvious that the argument of §§ 12.10-12.13, regarded simply 
as a proof of Legendre’s equation, is very long and complicated, and will 
not bear comparison with the economical proofs of §§ 12.5-12.8. The 
comparison, however, is not at all a fair one. The proof of§ 12.8, for example, 
is an ad hoc proof, a short cut to a particular formula; and it seems almost 
certain that Ramanujan must have possessed short cuts of the same kind. 
He certainly uscd similar methods sometimes (as is shown by the argument 
which I quoted in § 12.18), and the proof of § 12.8 is one which, whether he 
knew it or not, he could certainly have constructed. The proof which I have 
pieced together in §§12.10-12.13 is one of an entirely different kind; it 
wanders over large parts of the theory of elliptic functions, and the modular 
equation appears only as one of the climaxes of Ramanujan’s reconstruction 
of the theory. 

Singular moduli 

12,15. I shall say less about Ramanujan’s work in the closely connected 
theory of “singular moduli”. A good deal of this work is already in print;3 
and Watson, in his series of papers on singular moduli, has proved all 


« 16.2. There is a simple proof of this formula in the Papers, no. 11, 57-58. 
* 16.29 (it). 3 Papers, no. 6, 23-39. 
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Ramanujan’s results, and done a great deal to elucidate his probable 
method of procedure. I begin with a very short summary of the foundations 
of the orthodox theory. 

lt is familiar that 


(12.15.1) (Me, 1, @,) = R(g), 
where @ = O(4) = PU, Oy, &2), 


and Ris arational function, when wisaninteger. Can this be true for other 4? 
If so, then 2uw,, 2, are periods of (wu), and 


Boy = 0 + Big, Ul, = YO1 +d», 


where a, f, y, 6 are integers with 


(12.15.2) B+0, yO, ad-—fy$0. 
We may suppose 
(12.15.38) p>0 


(sinee otherwise we ean change the sign of a, f, y, 6 and x). Conversely, 
when these conditions are satisfied, fo() is an elliptie funetion with periods 
@,, ©, and so a rational funetion of @ and ¢’. Finally, since it is even, it is 
a rational funetion of g only. 

The situation is not affected if we replace w, and w, by Aw, and Aw,, so 
that only the ratio 
T = 04/W, 
is relevant. In these circumstances we shall say that “there is eomplex 

multiplication by » for 7”. For this, it is necessary and sufficient that 


(12.15.4) paatfr, pr=yt+sr, 
so that 
yidr 
2.15.5 = 
u Z . at Br 


with integral a, £, y, 6 satisfying (12.15.2) and (12.15.3). Thus 7 is the root, 
with positive imaginary part, of 


(12.15.86) frt+(a—d)r—-y = 0. 

It is easily verified that any number 4 +By, where A and B are integers, 
is a multiplier for 7 if y is one. 

12.16. We write 

(12.16.1) p=(~-y,a—6,f)>0 
and 

(12.16.2) —y=ap, a-d=bp, f=ep. 


} 
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Then 
(12.16.3) a+br+cr* = 0, 
where 
(12.16.4) a>0, ¢>0, B~4ac=—n<0, 
and 


(12.16.5) a a 


2c 


Further, we write 

(12.16.86) a+d = py. 
Then py +bp = 2a=0 (mod 2), 

(12.16.7) a= $(p,+0p), B=ep, y=—ap, 6=Hp,—bp), 
and 

(12.16.8) B=atfr = hptipdn), 

Conversely, if there are integers a, 6, ¢ satisfying (12.16.4), and p, p, 
satisfying 

(12.16.9) p>0, p,+b9=0 (mod 2), 


’ ot, B, y, 6 are then defined by (12.16.7), 7 by (12.16.5), and z=a+/37, then 
there is complex multiplication by y for 7. 


Further, if 
(12.16.10) e=0(6 even), e=1 (odd), 
(12.16.11) M = }(-e+i,/n), 

then M is also a multiplier for 7. For it is easy to verify that 
(12.16.12) p= a—b-e)p + pM, 


(12.16.13) M = (b-—e)+e7, Mr=—a—d(b+e)r. 
Since the last equations are of the form (12.15.4), Mis a multiplier (when 
fis one). 

On the other hand, if M is a multiplier for 7, it follows from (12.16.12), 
and the last remark of § 12.15, that wv is a multiplier. 


12.17. If 7 satisfies (12.15,.5), A, B, C, Dare integers with AD— BC = 1, 
and 


C+ Dr 
(12.17.1) tage, 
so that 
(12.17.2) J(7') = J(r), 
then 7’ satisfies an equation 
(12.17.38) a’ +6'r' +¢'7?%=0 


HR 15 


t 


< 
i 
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with the same determinant as (12.16.3), so that b’ has the same parity as 8, 
and the e’ and M’ corresponding to 7’ are the same as ¢ and M. It follows 
that there is multiplication for 7’ by M, and therefore by any which is a 
multiplier for 7. The values of 7 for which there is multiplication, and the 
corresponding multipliers, are determined uniquely by the values of J(r). 


12.18. If 7 and any positive integer m are given, the values of 


for different a, f, y, 6 satisfying 
ad —~ fy =m, 
are the roots of an equation F(x, J) = 0, 
where J = J(7), 
of degree m. If 7 satisfies (12.15.5), then 
(12.18.1) FJ, J) = 0. 


Hence each J(r) corresponding to a 7 for which there is complex multiplica- |’ 


tion satisfies an algebraic equation. We call these J(7) singular invariants, 
and the corresponding 4°(7) singular moduli; the singular moduli also satisfy 
algebraic equations. These equations are all soluble by radicals (though 
this depends on considerations of group theory of which Ramanujan knew 
nothing). 


12.19. All this is connected with the theory of arithmetical forms. 


Suppose that faba op, 


where (a,6,c) =1, 
is equivalent to fl sa’c? +b'x'y'+0'y"?, 
with e=antfy, y =ye+dy, ad-fy =], 
so that 
(12.19.1) 


a=aoe+tbayteo'y, b = 2a’aftb’(Py+ad)+2c'yd, c= a'f?+b'Pb+e'd, 
(a’,b’,¢’) =1, 
b? —4ae = 6" —da’e’ = —n. 
Then 6 and b’ have the same parity and, if 7 and 7’ are roots of 


(12.19.2) atbr+er?=0, a +b’r'+0'r? = 0, 


4 
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the M of § 12.16 has the same value for 7 and 7’. Thus there is complex 
multiplication by M for both 7 and 7’. Also 7 and 7’ satisfy 


(12.19.8) TET (ab fy = 1) 
and so 
(1219.4) I(r’) = dn). 


Conversely, suppose that (12.19.4) is true, that there is complex multi- 
plication for 7 and 7’, and that 7 and 7’ satisfy equations (12.19.2), with 
(a,b,c) =1 and (a’,b’,c’) =1. Since (12.19.4) is true, there are integers 
a, 8, y, 6 satisfying (12.19.3). Hence the equation 


a (a+ Br) +b’ (a+ fr) (y +67) +c (y+ or = 0 
has the same roots as at+br+or = 0, 


From this it follows easily that a, b, c satisfy (12.19.1), and that the forms 
fand f’ are equivalent. 

Thus (12.19.4) is a necessary and sufficient condition for equivalence, 
and the degree of (12.18.1) is the class number /(—7) for determinant —. 
-- It seems clear that Ramanujan knew nothing about the arithmetical 
side of the theory. 


12.20. The theory appears in the clearest light when stated in terms 
of J(r). For numerical work, other modular functions are more convenient, 
since the equations satisfied by J(7) tend to have large coefficients. 

It is usual to write 


9 »n_{4 ty 
jo) = oT (+a) = (ge) > 
2 4h?) ts 
ft) =the) = (FE) 


,2 ay. (SV 
jae) = Bah Thee) =(Fr) 


Then f= Sitti fife =v? 
If n = 4ac—b? = 8m+3 40 (mod 3), 
then the simplest invariant is 
fW-o}=hah 
while if n= 8m—140 (mod 3), 
it is 2 {(—n} =F, =F. 


T§°2 


oy 
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Ramanujan, however, uses the invariants 
G, = Ff {i(—n)} = 2-4, (n odd), 
Gn = 2-*f,(y(—m)} (nv even). 
The equations connecting f, f, and f, with J(7) are 


he + ks) 
_ (f4—16) _ (f#+16)? _ (ft +16) 
“areas i, eae Tis 


and f%, —f%, —f3! are the three roots of 
(w—16)8—aj(r) = 0. 
42.21. Ramanujan obtained the values of the simplest of his singular 


moduli directly from modular equations. He gives the complete proof in 
one case only, viz. n = 5. Here 


a= 7-9) = (sen) 


and G, = Gs 
(from the transformation 7’ = — 1/7). 
If u = 2-tf(r), v = 2-4f(5r), 
3 3 
7 eff) -afee-c) 
v u uy’ 


This is Schlafli’s form of the modular equation of degree 5, which occurs 
also in the notebook.t If 7 = i/,/5, q = e-/V5, then 


U=Q, vH=G, v=4, 


15 t 
_ (vot1 
and Ga; = ( 5) ) : 
Similarly, if we use the form? 
Q+agt = 2(P +5), 


Q P 
; i’ \t 
where PH2HkkW), Q= (we 
of the modular equation of degree 7, we can show that 


G, = 2. 
Watson, in his paper 20, works out the values of all of Ramanujan’s new 
singular moduli which it seems likely that he could have found in this way. 


¥ 19.13 (xiv), 2 19,19 (ix). 
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12.22. There are also many cases in which Ramanujan gives the values 
of singular moduli which he can hardly have determined rigorously. 
Let us suppose, to fix our ideas, that 


n= 8m—140 (mod 8). 


If the number of genera of classes of forms with determinant —n is NV, and 
the number of classes in each genus is 7, so that the class number is h = Nr, 
then j(7) and I, satisfy equations of degree h, soluble by radicals. The 
solution of the equations depends on that of a number of subsidiary 
equations, the number and degree of these equations being specifiable in 
terms of ¥ and r. Thus when n = 7, N =1,7 = 1, and # = 1 is rational, 
and when » = 23, V = 1, r = 3, and Ff, is given by a single cubic 


—F-1=0., 


In Watson’s paper 13 he deals with cases in which N =1, in 26 also with 
cases in which V=2 or N=4, When N = 2, two equations are required. 
Thus if n = 55, N = 2,7 = 2,h = 4, F satisfies the quartic 


Ft —2F84 F—-1=0 


_ 3 +25 

4 

by the adjunction of 5. Ifn = 455, N = 4,7 = 5, h = 20; and F satisfies 
an equation of degree 20 which can be reduced to a quintic by the adjunction 
of j8 and /13. 

Ramanujan did not know any of this arithmetical theory, and, when he 
was dealing with large values of 2, where no modular equation was available, 
must have been guided a good deal by guesswork and intuition. Watson, 
in his papers 8, 9, and 19, has made a very plausible reconstruction 
of the arguments which Ramanujan seems to have used. These all involve 
assumptions that certain numbers satisfy equations of appropriate degrees. 
In these assumptions, which always accord with the arithmetical theory, 
he seems to have been guided by a mixture of intuition and computation. 

Thus in 8 Watson reconstructs Ramanujan’s calculation of go (a 
value found also by Weber), and deduces one of the most striking of 
Ramanujan’s results, viz. that 


lb = (2-1)? (2 ~,/8) (¥7— f6)® (8— 3.4/7) (10 — 8) (4/15)? 
x (¥15—4/14) (6 —,/35) 


which reduces to (F-4)? 


when q= en7v 210, 


This result Ramanujan proved rigorously, granted the value of gor9, by a 
very curious algebraical lemma. 


f 
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NOTES ON LECTURE XII 


§ 12.1. The catalogue referred to on p. 20 (note on p. 10) contains Cayley, Greonhill, 
Tannery and Molk, A. C. Dixon’s The elementary properties of elliptic functions, Appell 
and Lacour’s Principes de la théorie des fonctions elliptiques, and the first edition of 
Whittaker’s Modern analysis. It is however clear that Ramanujan had not read any 
account of elliptic functions based (like those in Tannery and Molk or Whittaker) on 
general function theory. 

The quotation from Littlewood is from his roview of the Papers. 

§ 12.2. There are proofs of Jacobi’s identity, and deductions of thoso of Euler and 
Gauss, in Hardy and Wright, §§ 19.8-9. 

$12.3. See Watson’s lecture 10, Jacobi, FPundamenta nova, §41, and Enneper, 
Elliptische Funktionen, § 13. 

The last sentence is justified by the proof (from no. 18 of the Papers) reproduced in 
§ 9.2. 

§12.5. See Cayley, chs. va and vu (especially pp. 188-190), Greenhill, ch. x 
(especially p. 323). There is a better account of the general theory in Enneper, cli. 1x 
(especially pp. 225-237), but Jacobi’s treatment of the special casen = 3 is stated very 
shortly in an appendix (p. 518). 

§ 12.9. See Tannery and Molk, 1i, 163-166, and iv, 230-231, 242-244. 

§ 12,12. For (12,12.1) see Tannery and Molk, ili, 120-134 and iv, 105 (Tablo CVITI). 

(12.12.2) reduces to Jacobi’s formula quoted in § 12.2 when a= £ = 0. a 

§ 12.13. Sce Tannery and Molk, ii, 114-119 and 268 (Table XLVII). 


§§ 12.15-19. My sketch of the orthodox theory 1s based on Tannory and Molk, iv, 
254-260. 

§12.19. The classnumborliercisthe nuraber of primitive classes of determinant —7; 
we have supposed throughout that (a,0,c) = 1. The whole argument 1s based on 
Kronecker’s theory of forms ax? + bry + cy*, in which 6 is not necessarily oven and the 
determinant is 6?—4ac: thus #74 7y? has determimant — 28. In the older theory of 
Gauss, the form is ax?+2bxy+ cy? and the determinant 6?—ae, so that 27+ 7y® has 
determinant —7. 

Ihave in my possession a letter to Ramanujan from Professor W. E. H. Berwick 
which throws some light on the last sentence of this section. Professor Berwick, after 
acknowledging the receipt of Ramanujan’s paper “Modular equations and approxi- 
mations to 7” (no. 6 of the Papers), and giving a number of references, explains that 
J satisfies an irreducible equation of degree h, and that k* can sometimes, but not 
always, be expressed rationally in terms of J. In particular he points out that J(i./257) 
and £(¢./257) satisfy equations of degree 16 soluble by chains of four quadratics. 

$12.20. See Watson (13). 

§ 12.21. The proof for n = 5 is given in no. 6 of the Papers. 


§ 12.22. All the cxamples are taken from Watson’s papors. 
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